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Preface to the second edition 


Since the publication of the first edition of The Description Logic Handbook 
in 2003, the interest in Description Logics (DL) has steadily increased. This 
applies both to the number of active DL researchers working on DL theory 
and implementations of reasoning services, and to the number of applications 
based on DL technology. One effect of this growing interest was that the 
first edition of the Handbook has gone through quite a number of reprints. 
Another effect is, of course, that in the last three years there have been 
interesting new developments in the three areas (theory, implementation, 
and applications) that the Handbook covers. Despite that, we feel that most 
chapters of the Handbook still provide a good introduction to the field and 
lay a solid foundation that enables the reader to understand and put into 
context the research articles describing results since 2003. For this reason, 
we have decided to leave most of the chapters unchanged. 

The principal exception is Chapter 14, which in the first edition was enti- 
tled “Digital Libraries and Web-Based Information Systems.” This chapter 
provided a selected history of the use of Description Logics in web-based in- 
formation systems, and the developments related to emerging web ontology 
languages such as OIL and DAML+OIL. Since the writing of this chap- 
ter, the new language OWL has been developed and recommended by the 
World Wide Web consortium as the standard web ontology language for 
the Semantic Web. In the second edition, Chapter 14, now co-authored by 
Peter Patel-Schneider, concentrates on OWL, which is reflected by its new 
title: “OWL: a Description-Logic-Based Ontology Language for the Seman- 
tic Web.” The chapter still briefly reviews some early efforts that combine 
Description Logics and the Web, including predecessors of OWL such as OIL 
and DAML+OIL. But then it goes on to describe OWL in some detail, in- 
cluding the various influences on its design, its relationship with RDFS, its 
syntax and semantics, and a range of tools and applications. 



xiv Preface to the second edition 

A minor change was made in Chapter 2. In fact, Proposition 2.9 in the 
first edition, which tried to give a syntactic criterion for the existence of 
fixpoint models of cyclic terminologies, turned out to be wrong. 1 In the 
second edition, it has been replaced by a correct criterion, now given in 
Proposition 2.10. The new material starts with (the new) Proposition 2.8 
and ends with Proposition 2.10. 

We are indebted to David Tranah, our editor at Cambridge University 
Press, for his patience during the preparation of this second edition, but 
also for the gentle pressure he exerted, without which this second edition 
would probably not have been completed. 

1 This problem was independently detected by several Ph.D. students, including Yuming Shen, 
Hongkai Liu, and Boontawee Suntisrivaraporn. Thank you for your careful reading! 



Preface 


Knowledge Representation is the field of Artificial Intelligence that focuses 
on the design of formalisms that are both epistemologically and computa- 
tionally adequate for expressing knowledge about a particular domain. One 
of the main lines of investigation has been concerned with the principle that 
knowledge should be represented by characterizing classes of objects and the 
relationships between them The organization of the classes used to describe 
a domain of interest is based on a hierarchical structure, which not only 
provides an effective and compact representation of information, but also 
allows the relevant reasoning tasks to be performed in a computationally 
effective way. 

The above principle drove the development of the first frame-based sys- 
tems and semantic networks in the 1970s. However, these systems were 
in general not formally defined and the associated reasoning tools were 
strongly dependent on the implementation strategies. A fundamental step 
towards a logic-based characterization of required formalisms was accom- 
plished through the work on the Kl-One system, which collected many of 
the ideas stemming from earlier semantic networks and frame-based systems, 
and provided a logical basis for interpreting objects, classes (or concepts), 
and relationships (or links, roles) between them. The first goal of such a log- 
ical reconstruction was the precise characterization of the set of constructs 
used to build class and link expressions. The second goal was to provide 
reasoning procedures that are sound and complete with respect to the se- 
mantics. The article “The tractability of subsumption in Frame-Based De- 
scription Languages” by Ron Brachman and Hector Levesque, presented at 
AAAI 1984, addressing the tradeoff between the expressiveness of Kl-One— 
like languages and the computational complexity of reasoning, is usually re- 
garded as the origin of research on Description Logics. 
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Subsequent research came under the label terminological systems to em- 
phasize the fact that classes and relationships were used to establish the 
basic terminology adopted in the modeled domain. Still later, the emphasis 
was on the set of concept forming constructs admitted in the language, giv- 
ing rise to the name concept languages. Recently, attention has moved closer 
to the properties of the underlying logical systems, and the term Description 
Logics has become popular. 

Research on Description Logics has covered theoretical aspects, implemen- 
tation of knowledge representation systems (modern frame-based systems) 
and the use of such systems to realize applications in several areas. This pat- 
tern of development is an example of one of the standard research method- 
ologies, as is recognized by the Artificial Intelligence community. The key 
element has been the very close interaction between theory and practice. On 
the one hand, there are various implemented systems based on Description 
Logics, offering a palette of description formalisms with differing expressive 
power, and which are employed in various application domains (such as nat- 
ural language processing, configuration of technical systems, databases). On 
the other hand, the formal and computational properties (like decidability, 
complexity) of various description formalisms have been studied in detail. 
These investigations are usually motivated by the use of certain constructors 
in systems or the need for these constructors in specific applications, and 
the results of such investigations have strongly influenced the design of new 
systems. 

The Description Logics research community currently consists of at least 
100 active researchers. In addition, other communities are now becoming 
interested in Description Logics, most notably the Databases community 
and, more recently, the Semantic Web one. After more than a decade of 
research on Description Logics there is a substantial body of work and well- 
established technical literature. However, there is no comprehensive presen- 
tation of the major achievements in the field, although survey papers have 
been published and workshop proceedings are available. 

Now, since 1989 a workshop dedicated to Description Logics has been 
held, initially every two years but annually from 1994. At the 1997 work- 
shop a Working Group was formed to develop a proposal for a book that 
would provide a systematic introduction to Description Logics, covering all 
aspects of the research in the field, namely: theory, implementation, and 
applications. Following the spirit that fostered this research, The Descrip- 
tion Logic Handbook would provide a thorough introduction to Descrip- 
tion Logics both for the more theoretically oriented reader interested in the 
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formal study of Description Logics and for the more practically oriented 
reader aiming at a principled usage of knowledge representation systems 
based on Description Logics. Although some refinements have been made 
to the initial proposal to embody recent developments in the field, the final 
structure of the Handbook reflects the original intentions. 

The Handbook is organized into three parts plus an initial chapter pro- 
viding a general introduction to the field. 

Part I addresses the theoretical work in Description Logics and includes 
five chapters. Chapter 2 introduces Description Logics as a formal language 
for representing knowledge and reasoning about it. Chapter 3 addresses the 
computational complexity of reasoning in several Description Logics. Chap- 
ter 4 explores the relationship with other representation formalisms, within 
and outside the field of Knowledge Representation. Chapter 5 covers ex- 
tensions of the basic Description Logics introduced in Chapter 2 by very 
expressive constructs that require advanced reasoning techniques. 

Chapter 6 considers extensions of Description Logics by representation 
features and non-standard inference problems not available in the basic 
framework. 

Part II is concerned with the implementation of knowledge representation 
systems based on Description Logics. Chapter 7 describes the features that 
need to be provided, in addition to the inference engine for a particular 
Description Logic, to build a knowledge representation system. Chapter 8 
reviews implemented knowledge representation systems based on Descrip- 
tion Logics that have played or play an important role in the field. Chapter 9 
describes the implementation of the reasoning services which form the core 
of Description Logic knowledge representation systems. 

Part III addresses the deployment of Description Logics in the design and 
implementation of fielded applications. Chapter 10 discusses the issues in- 
volved in the development of an ontology for some universe of discourse, 
which is to become a conceptual model or knowledge base represented and 
reasoned with using Description Logics. Chapter 11 presents applications of 
Description Logics in the area of software engineering. Chapter 12 introduces 
the problem of configuration and the largest and longest lived family of De- 
scription Logic-based configurators. Chapter 13 is concerned with the use of 
Description Logics in various kinds of applications in medical informatics - 
terminology, intelligent user interfaces, decision support and semantic index- 
ing, language technology, and systems integration. Chapter 14 reviews the 
applications of Description Logics in web-based information systems, and 
the more recent developments related to languages for the Semantic Web. 



xviii Preface 

Chapter 15 analyzes the uses of Description Logics for natural language 
processing to encode syntactic, semantic, and pragmatic elements needed 
to drive semantic interpretation and natural language generation processes. 
Chapter 16 surveys the major classes of application of Description Logics 
and their reasoning facilities to the issues of data management, including 
the expression of the conceptual domain model/ontology of the data source, 
the integration of multiple data sources, and the formulation and evaluation 
of queries. 

The syntax and semantics for Description Logics is summarized in an Ap- 
pendix, which has been used as a reference to unify the notation throughout 
the book. Finally, an extended, integrated bibliography is provided and, 
within each chapter, comprehensive guides through the relevant literature 
are given. 

The chapters are written by some of the most prominent researchers in 
the field, introducing the basic technical material before taking the reader to 
the current state of the subject. The chapters have been reviewed in a two 
step process, which involved two or three reviewers for each chapter. We have 
relied on the work of several external reviewers, selected both within the De- 
scription Logic community, and outside the field, to increase the readability 
for non experts. In addition, each chapter has been read also by authors of 
other chapters, to improve the overall coherence. 

As such, the book is conceived as a unique reference for the subject. Al- 
though not intended as a textbook, the Handbook can be used as a basis for 
specialized courses on Description Logics. In addition, some of the chapters 
can be used as teaching material in Knowledge Representation courses. The 
Handbook is also a comprehensive reference to the subject in more introduc- 
tory courses in the field of Artificial Intelligence. 

We want to acknowledge the contribution and help of several people. First 
of all, the authors, who have successfully accomplished the hardest task of 
writing the chapters, carefully addressing the reviewers’ comments as well 
as the issues raised by the effort in making the presentation and notation 
uniform. Second, we thank the reviewers for their precious work, which led 
to significant improvements in the final outcome. The external reviewers 
were: 

Premkumar T. Devanbu, 

Peter L. Elkin, 

Jerome Euzenat, 

Erich Gradel, 

Michael Gruninger, 
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Frank van Harmelen, 

Jana Koehler, 

Diane Litrnan, 

Robert M. MacGregor, 

Amedeo Napoli, 

Hans-Jiirgen Ohlbach, 

Marie-Christine Rousset, 

Nestor Rychtyckyj, 

Renate Schmidt, 

James G. Schmolze, 

Roberto Sebastiani, 

Michael Uschold, 

Moshe Y. Vardi, 

Grant Weddell, 

Robert A. Weida. 

A special thank you goes also to Christopher A. Welty who, besides serving 
as a reviewer, also coordinated the reviewing process for some of the chap- 
ters. Third, we express our gratitude to the Description Logics community as 
a whole (see also the Description Logics homepage at http://dl.kr.org/) 
for the outstanding research achievements and for applying the pressure that 
enabled us to complete the Handbook. Finally, we are indebted to Cambridge 
University Press, and, in particular, to David Tranah, for giving us the op- 
portunity to put the Handbook together and for the excellent support in the 
editing process. 
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An Introduction to Description Logics 


Daniele Nardi 
Ronald J. Brachman 


Abstract 

This introduction presents the main motivations for the development of De- 
scription Logics (DLs) as a formalism for representing knowledge, as well as 
some important basic notions underlying all systems that have been created 
in the DL tradition. In addition, we provide the reader with an overview of 
the entire book and some guidelines for reading it. 

We first address the relationship between Description Logics and earlier 
semantic network and frame systems, which represent the original heritage 
of the field. We delve into some of the key problems encountered with the 
older efforts. Subsequently, we introduce the basic features of DL languages 
and related reasoning techniques. 

DL languages are then viewed as the core of knowledge representation 
systems, considering both the structure of a DL knowledge base and its asso- 
ciated reasoning services. The development of some implemented knowledge 
representation systems based on Description Logics and the first applications 
built with such systems are then reviewed. 

Finally, we address the relationship of Description Logics to other fields of 
Computer Science. We also discuss some extensions of the basic representa- 
tion language machinery; these include features proposed for incorporation 
in the formalism that originally arose in implemented systems, and features 
proposed to cope with the needs of certain application domains. 


1.1 Introduction 

Research in the field of knowledge representation and reasoning is usually 
focused on methods for providing high-level descriptions of the world that 
can be effectively used to build intelligent applications. In this context, 
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“intelligent” refers to the ability of a system to find implicit consequences 
of its explicitly represented knowledge. Such systems are therefore charac- 
terized as knowledge-based systems. 

Approaches to knowledge representation developed in the 1970s - 
when the field enjoyed great popularity - are sometimes divided roughly 
into two categories: logic-based formalisms, which evolved out of the intu- 
ition that predicate calculus could be used unambiguously to capture facts 
about the world; and other, non-logic-based representations. The latter were 
often developed by building on more cognitive notions - for example, net- 
work structures and rule-based representations derived from experiments on 
recall from human memory and human execution of tasks like mathematical 
puzzle solving. Even though such approaches were often developed for spe- 
cific representational chores, the resulting formalisms were usually expected 
to serve in general use. In other words, the non-logical systems created from 
very specific lines of thinking (e.g., early production systems) evolved to 
be treated as general-purpose tools, expected to be applicable in different 
domains and to different types of problems. 

On the other hand, since first-order logic provides very powerful and gen- 
eral machinery, logic-based approaches were more general-purpose from the 
very start. In a logic-based approach, the representation language is usu- 
ally a variant of first-order predicate calculus, and reasoning amounts to 
verifying logical consequence. In the non-logical approaches, often based on 
the use of graphical interfaces, knowledge is represented by means of some 
ad hoc data structures, and reasoning is accomplished by similarly ad hoc 
procedures that manipulate the structures. Among these specialized repre- 
sentations we find semantic networks and frames. Semantic networks were 
developed after the work of Quillian [1967], with the goal of characteriz- 
ing by means of network-shaped cognitive structures the knowledge and the 
reasoning of the system. Similar goals were shared by later frame systems 
[Minsky, 1981], which rely on the notion of a “frame” as a prototype and on 
the capability of expressing relationships between frames. Although there are 
significant differences between semantic networks and frames, both in their 
motivating cognitive intuitions and in their features, they have a strong com- 
mon basis. In fact, they can both be regarded as network structures, where 
the structure of the network aims at representing sets of individuals and 
their relationships. Consequently, we use the term network-based structures 
to refer to the representation networks underlying semantic networks and 
frames (see [Lehmann, 1992] for a collection of papers concerning various 
families of network-based structures). 
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Owing to their more human-centered origins, the network-based systems 
were often considered more appealing and more effective from a practical 
viewpoint than the logical systems. Unfortunately, they were not fully satis- 
factory, because of their usual lack of precise semantic characterization. The 
end result of this was that every system behaved differently from the oth- 
ers, in many cases despite virtually identical-looking components and even 
identical relationship names. The question then arose as to how to provide 
semantics to representation structures, in particular to semantic networks 
and frames, which carried the intuition that, by exploiting the notion of hi- 
erarchical structure, one could gain both in terms of ease of representation 
and in terms of the efficiency of reasoning. 

One important step in this direction was the recognition that frames (at 
least their core features) could be given a semantics by relying on first-order 
logic [Hayes, 1979]. The basic elements of the representation are character- 
ized as unary predicates, denoting sets of individuals, and binary predicates, 
denoting relationships between individuals. However, such a characterization 
does not capture the constraints of semantic networks and frames with re- 
spect to logic. Indeed, although logic is the natural basis for specifying a 
meaning for these structures, it turns out that frames and semantic net- 
works (for the most part) did not require all the machinery of first-order 
logic, but could be regarded as fragments of it [Brachman and Levesque, 
1985]. In addition, different features of the representation language would 
lead to different fragments of first-order logic. The most important con- 
sequence of this fact is the recognition that the typical forms of reasoning 
used in structure-based representations could be accomplished by specialized 
reasoning techniques, without necessarily requiring first-order logic theorem 
provers. Moreover, reasoning in different fragments of first-order logic leads 
to computational problems of differing complexity. 

Subsequent to this realization, research in the area of Description Logics 
began under the label terminological systems , to emphasize that the repre- 
sentation language was used to establish the basic terminology adopted in 
the modeled domain. Later, the emphasis was on the set of concept-forming 
constructs admitted in the language, giving rise to the name concept lan- 
guages. In more recent years, after attention was further moved towards the 
properties of the underlying logical systems, the term Description Logics 
became popular. 

In this book we mainly use the term “Description Logics” for the represen- 
tation systems, but often use the word “concept” to refer to the expressions 
of a DL language, denoting sets of individuals, and the word “terminology” 
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to denote a (hierarchical) structure built to provide an intensional represen- 
tation of the domain of interest. 

Research on Description Logics has covered theoretical underpinnings as 
well as implementation of knowledge representation systems and the de- 
velopment of applications in several areas. This kind of development has 
been quite successful. The key element has been the methodology of re- 
search, based on a very close interaction between theory and practice. On 
the one hand, there are various implemented systems based on Descrip- 
tion Logics, which offer a palette of description formalisms with differing 
expressive power, and which are employed in various application domains 
(such as natural language processing, configuration of technical products, 
or databases). On the other hand, the formal and computational proper- 
ties of reasoning (like decidability and complexity) of various description 
formalisms have been investigated in detail. The investigations are usually 
motivated by the use of certain constructors in implemented systems or by 
the need for these constructors in specific applications - and the results have 
influenced the design of new systems. 

This book is meant to provide a thorough introduction to Description 
Logics, covering all the above-mentioned aspects of DL research - namely 
theory, implementation, and applications. Consequently, the book is divided 
into three parts: 


• Part I introduces the theoretical foundations of Description Logics, addressing 
some of the most recent developments in theoretical research in the area; 

• Part II focuses on the implementation of knowledge representation systems based 
on Description Logics, describing the basic functionality of a DL system, survey- 
ing the most influential knowledge representation systems based on Description 
Logics, and addressing specialized implementation techniques; 

• Part III addresses the use of Description Logics and of DL-based systems in the 
design of several applications of practical interest. 


In the remainder of this introductory chapter, we review the main steps 
in the development of Description Logics, and introduce the main issues 
that are dealt with later in the book, providing pointers for its reading. In 
particular, in the next section we address the origins of Description Logics 
and then we review knowledge representation systems based on Description 
Logics, the main applications developed with Description Logics, the main 
extensions to the basic DL framework, and relationships with other fields of 
Computer Science. 
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1.2 From networks to Description Logics 

In this section we begin by recalling approaches to representing knowledge 
that were developed before research on Description Logics began (i.e., se- 
mantic networks and frames). We then provide a very brief introduction to 
the basic elements of these approaches, based on Tarski-style semantics. Fi- 
nally, we discuss the importance of computational analyses of the reasoning 
methods developed for Description Logics, a major ingredient of research in 
this field. 


1.2.1 Network-based representation structures 

In order to provide some intuition about the ideas behind representations 
of knowledge in network form, we here speak in terms of a generic network, 
avoiding references to any particular system. The elements of a network are 
nodes and links. Typically, nodes are used to characterize concepts, i.e., sets 
or classes of individual objects, and links are used to characterize relation- 
ships among them. In some cases, more complex relationships are themselves 
represented as nodes; these are carefully distinguished from nodes represent- 
ing concepts. In addition, concepts can have simple properties, often called 
attributes, which are typically attached to the corresponding nodes. Finally, 
in many of the early networks both individual objects and concepts were 
represented by nodes. Here, however, we restrict our attention to knowl- 
edge about concepts and their relationships, deferring for now treatment of 
knowledge about specific individuals. 

Let us consider a simple example, whose pictorial representation is given 
in Figure 1.1, which represents knowledge concerning persons, parents, chil- 
dren, etc. The structure in the figure is also referred to as a terminology , 
and it is indeed meant to represent the generality or specificity of the con- 
cepts involved. For example the link between Mother and Parent says that 
“mothers are parents”; this is sometimes called an “IS- A” relationship. 

The IS-A relationship defines a hierarchy over the concepts and provides 
the basis for the “inheritance of properties” : when a concept is more specific 
than some other concept, it inherits the properties of the more general one. 
For example, if a person has an age, then a woman has an age, too. This 
is the typical setting of the so-called (monotonic) inheritance networks (see 
[Brachman, 1979]). 

A characteristic feature of Description Logics is their ability to represent 
other kinds of relationships that can hold between concepts, beyond IS- 
A relationships. For example, in Figure 1.1, which follows the notation of 
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[Brachman and Schmolze, 1985], the concept of Parent has a property that 
is usually called a “role” , expressed by a link from the concept to a node for 
the role labeled hasChild. The role has what is called a “value restriction”, 
denoted by the label v/r, which expresses a limitation on the range of types of 
objects that can fill that role. In addition, the node has a number restriction 
expressed as (1,NIL), where the first number is a lower bound on the number 
of children and the second element is the upper bound, and NIL denotes 
infinity. Overall, the representation of the concept of Parent here can be 
read as “A parent is a person having at least one child, and all of his/her 
children are persons.” 

Relationships of this kind are inherited from concepts to their subcon- 
cepts. For example, the concept Mother, i.e. , a female parent, is a more 
specific descendant of both the concepts Female and Parent, and as a result 
inherits from Parent the link to Person through the role hasChild; in other 
words, Mother inherits the restriction on its hasChild role from Parent. 

Observe that there may be implicit relationships between concepts. For 
example, if we define Woman as the concept of a female person, it is the case 
that every Mother is a Woman. It is the task of the knowledge representation 
system to find implicit relationships such as these (many are more complex 
than this one) . Typically, such inferences have been characterized in terms of 
properties of the network. In this case one might observe that both Mother 
and Woman are connected to both Female and Person, but the path from 
Mother to Person includes a node Parent, which is more specific then Person, 
thus enabling us to conclude that Mother is more specific than Person. 

However, the more complex the relationships established among concepts, 
the more difficult it becomes to give a precise characterization of what kind 
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of relationships can be computed, and how this can be done without failing 
to recognize some of the relationships or without providing wrong answers. 


1.2.2 A logical account of network-based representation 

structures 

Building on the above ideas, a number of systems were implemented and 
used in many kinds of applications. As a result, the need emerged for a 
precise characterization of the meaning of the structures used in the rep- 
resentations and of the set of inferences that could be drawn from those 
structures. 

A precise characterization of the meaning of a network can be given by 
defining a language for the elements of the structure and by providing an 
interpretation for the strings of that language. While the syntax may have 
different flavors in different settings, the semantics is typically given as a 
Tarski-style semantics. 

For the syntax we introduce a kind of abstract language, which resembles 
other logical formalisms. The basic step of the construction is provided by 
two disjoint alphabets of symbols that are used to denote atomic concepts , 
designated by unary predicate symbols, and atomic roles , designated by bi- 
nary predicate symbols; the latter are used to express relationships between 
concepts. 

Terms are then built from the basic symbols using several kinds of con- 
structors. For example, intersection of concepts , which is denoted C n D, 
is used to restrict the set of individuals under consideration to those that 
belong to both C and D. Notice that, in the syntax of Description Logics, 
concept expressions are variable- free. In fact, a concept expression denotes 
the set of all individuals satisfying the properties specified in the expres- 
sion. Therefore, C n D can be regarded as the first-order logic sentence, 
C(x) A D(x), where the variable ranges over all individuals in the interpre- 
tation domain and C(x) is true for those individuals that belong to the 
concept C. 

In this book, we will present other syntactic notations that are more 
closely related to the concrete syntax adopted by implemented DL systems, 
and which are more suitable for the development of applications. One ex- 
ample of concrete syntax proposed in [Patel-Schneider and Swartout, 1993] 
is based on a LlSP-like notation, where the concept of female persons, for 
example, is denoted by (and Person Female). 

The key characteristic features of Description Logics reside in the con- 
structs for establishing relationships between concepts. The basic ones are 
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value restrictions. For example, a value restriction, written VR.C, requires 
that all the individuals that are in the relationship R with the concept being 
described belong to the concept C (technically, it is all individuals that are 
in the relationship R with an individual described by the concept in question 
that are themselves describable as C’ s). 

As for the semantics, concepts are given a set-theoretic interpretation: 
a concept is interpreted as a set of individuals, and roles are interpreted 
as sets of pairs of individuals. The domain of interpretation can be chosen 
arbitrarily, and it can be infinite. The non-finiteness of the domain and 
the open-world assumption are distinguishing features of Description Logics 
with respect to the modeling languages developed in the study of databases 
(see Chapters 4 and 16). 

Atomic concepts are thus interpreted as subsets of the intepretation do- 
main, while the semantics of the other constructs is then specified by defining 
the set of individuals denoted by each construct. For example, the concept 
C n D is the set of individuals obtained by intersecting the sets of indi- 
viduals denoted by C and D, respectively. Similarly, the interpretation of 
Vi ?. C is the set of individuals that are in the relationship R with individuals 
belonging to the set denoted by the concept C. 

As an example, let us suppose that Female, Person, and Woman are atomic 
concepts and that hasChild and hasFemaleRelative are atomic roles. Using the 
operators intersection , union and complement of concepts, interpreted as set 
operations, we can describe the concept of “persons that are not female” and 
the concept of “individuals that are female or male” by the expressions 

Person n -Female and Female U Male. 

It is worth mentioning that intersection, union, and complement of concepts 
have been also referred to as concept conjunction , concept disjunction and 
concept negation , respectively, to emphasize the relationship to logic. 

Let us now turn our attention to role restrictions by looking first at quan- 
tified role restrictions and, subsequently, at what we call “number restric- 
tions”. Most languages provide (full) existential quantification and value 
restriction that allow one to describe, for example, the concept of “individ- 
uals having a female child” as dhasChild. Female, and to describe the concept 
of “individuals all of whose children are female” by the concept expression 
VhasChild. Female. In order to distinguish the function of each concept in the 
relationship, the individual object that corresponds to the second argument 
of the role viewed as a binary predicate is called a role filler. In the above 
expressions, which describe the properties of parents having female children, 
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individual objects belonging to the concept Female are the fillers of the role 

hasChild. 

Existential quantification and value restrictions are thus meant to charac- 
terize relationships between concepts. In fact, the role link between Parent 
and Person in Figure 1.1 can be expressed by the concept expression 

3hasChild. Person n VhasChild. Person. 

Such an expression therefore characterizes the concept of Parent as the set 
of individuals having at least one filler of the role hasChild belonging to the 
concept Person; moreover, every filler of the role hasChild must be a person. 

Finally, notice that in quantified role restrictions the variable being quan- 
tified is not explicitly mentioned. The corresponding sentence in first-order 
logic is Vy.R(x,y ) D C(y), where x is again a free variable ranging over the 
interpretation domain. 

Another important kind of role restriction is given by number restrictions, 
which restrict the cardinality of the sets of role fillers. For instance, the 
concept 

3 hasChild) n 2 hasFemaleRelative) 

represents the concept of “individuals having at least three children and at 
most two female relatives”. Number restrictions are sometimes viewed as 
a distinguishing feature of Description Logics, although one can find some 
similar constructs in some database modeling languages (notably Entity- 
Relationship models). 

Beyond the constructs to form concept expressions, Description Logics 
provide constructs for roles, which can, for example, establish role hierar- 
chies. However, the use of role expressions is generally limited to expressing 
relationships between concepts. 

Intersection of roles is an example of a role- forming construct. Intuitively, 
hasChild n hasFemaleRelative yields the role “has-daughter” , so that the con- 
cept expression 

Woman n ^ 2 (hasChild n hasFemaleRelative) 

denotes the concept of “a woman having at most 2 daughters” . 

A more comprehensive view of the basic definitions of DL languages will 
be given in Chapter 2. 


1.2.3 Reasoning 

The basic inference on concept expressions in Description Logics is subsump- 
tion, typically written as CCD. Determining subsumption is the problem 
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of checking whether the concept denoted by D (the subsumer ) is considered 
more general than the one denoted by C (the subsumee). In other words, 
subsumption checks whether the first concept always denotes a subset of the 
set denoted by the second one. 

For example, one might be interested in knowing whether Woman C 
Mother. In order to verify this kind of relationship one has in general to 
take into account the relationships defined in the terminology. As we ex- 
plain in the next section, under appropriate restrictions, one can embody 
such knowledge directly in concept expressions, thus making subsumption 
over concept expressions the basic reasoning task. Another typical infer- 
ence on concept expressions is concept satisfiability , which is the problem 
of checking whether a concept expression does not necessarily denote the 
empty concept. In fact, concept satisfiability is a special case of subsump- 
tion, with the subsumer being the empty concept, meaning that a concept 
is not satisfiable. 

Although the meaning of concepts had already been specified with a log- 
ical semantics, the design of inference procedures in Description Logics was 
influenced for a long time by the tradition of semantic networks, where con- 
cepts were viewed as nodes and roles as links in a network. Subsumption 
between concept expressions was recognized as the key inference and the ba- 
sic idea of the earliest subsumption algorithms was to transform two input 
concepts into labeled graphs and test whether one could be embedded into 
the other; the embedded graph would correspond to the more general con- 
cept (the subsumer) [Lipkis, 1982]. This method is called structural compar- 
ison, and the relation between concepts being computed is called structural 
subsumption. However, a careful analysis of the algorithms for structural 
subsumption shows that they are sound , but not always complete in terms 
of the logical semantics: whenever they return “yes” the answer is correct, 
but when they report “no” the answer may be incorrect. In other words, 
structural subsumption is in general weaker than logical subsumption. 

The need for complete subsumption algorithms is motivated by the fact 
that in the usage of knowledge representation systems it is often necessary 
to have a guarantee that the system has not failed in verifying subsumption. 
Consequently, new algorithms for computing subsumption have been devised 
that are no longer based on a network representation, and these can be 
proven to be complete. Such algorithms have been developed by specializing 
classical settings for deductive reasoning to the DL subsets of first-order 
logics, as done for tableau calculi by Schmidt- Schaufi and Srnolka [1991], 
and also by more specialized methods. 
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In the paper “The tractability of subsumption in frame-based description 
languages”, Brachman and Levesque [1984] argued that there is a tradeoff 
between the expressiveness of a representation language and the difficulty of 
reasoning over the representations built using that language. In other words, 
the more expressive the language, the harder the reasoning. They also pro- 
vided a first example of this tradeoff by analyzing the language F C~ (Frame 
Language), which included intersection of concepts, value restrictions and a 
simple form of existential quantification. They showed that for such a lan- 
guage the subsumption problem could be solved in polynomial time, while 
adding a construct called role restriction to the language makes subsumption 
a coNP-hard problem (the extended language was called FC). 

The paper by Brachman and Levesque introduced at least two new ideas: 

1. “efficiency of reasoning” over knowledge structures can be studied using the 
tools of computational complexity theory; 

2. different combinations of constructs can give rise to languages with different 
computational properties. 

An immediate consequence of the above observations is that one can study 
formally and methodically the tradeoff between the computational com- 
plexity of reasoning and the expressiveness of the language, which it- 
self is defined in terms of the constructs that are admitted in the lan- 
guage. After the initial paper, a number of results on this tradeoff for 
concept languages were obtained (see Chapters 2 and 3), and these re- 
sults allow us to draw a fairly complete picture of the complexity of 
reasoning for a wide class of concept languages. Moreover, the problem 
of finding the optimal tradeoff, namely the most expressive extensions 
of FC~ with respect to a given set of constructs that still keep sub- 
sumption polynomial, has been studied extensively [Donini et al, 1991b; 
1999]. 

One of the assumptions underlying this line of research is to use worst- 
case complexity as a measure of the efficiency of reasoning in Description 
Logics (and more generally in knowledge representation formalisms). Such 
an assumption has sometimes been criticized (see for example [Doyle and 
Patil, 1991]) as not adequately characterizing system performance or ac- 
counting for more average-case behavior. While this observation suggests 
that computational complexity alone may not be sufficient for addressing 
performance issues, research on the computational complexity of reasoning 
in Description Logics has most definitely led to a much deeper understand- 
ing of the problems arising in implementing reasoning tools. Let us briefly 
address some of the contributions of this body of work. 
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First of all, the study of the computational complexity of reasoning in 
Description Logics has led to a clear understanding of the properties of the 
language constructs and their interaction. This is not only valuable from a 
theoretical viewpoint, but gives insight to the designer of deduction proce- 
dures, with clear indications of the language constructs and their combina- 
tions that are difficult to deal with, as well as general methods to cope with 
them. 

Secondly, the complexity results have been obtained by exploiting a gen- 
eral technique for satisfiability checking in concept languages, which relies on 
a form of tableau calculus [Schmidt-Schaufi and Srnolka, 1991]. Such a tech- 
nique has proved extremely useful for studying both the correctness and the 
complexity of the algorithms. More specifically, it provides an algorithmic 
framework that is parametric with respect to the language constructs. The 
algorithms for concept satisfiability and subsumption obtained in this way 
have also led directly to practical implementations by application of clever 
control strategies and optimization techniques. The most recent knowledge 
representation systems based on Description Logics adopt tableau calculi 
[Horrocks, 1998b]. 

Thirdly, the analysis of pathological cases in this formal framework has 
led to the discovery of incompleteness in the algorithms developed for im- 
plemented systems. This has also consequently proven useful in the defi- 
nition of suitable test sets for verifying implementations. For example, the 
comparison of implemented systems (see for example [Baader et al., 1992b; 
Heinsohn et al., 1992]) has greatly benefitted from the results of the com- 
plexity analysis. 

The basic reasoning techniques for Description Logics are presented in 
Chapter 2, while a detailed analysis of the complexity of reasoning problems 
in several languages is developed in Chapter 3. 

After the tradeoff between expressiveness and tractability of reasoning 
was thoroughly analyzed and the range of applicability of the corresponding 
inference techniques had been experimented with, there was a shift of focus 
in the theoretical research on reasoning in Description Logics. Interest grew 
in relating Description Logics to the modeling languages used in database 
management. In addition, the discovery of strict relationships with expres- 
sive modal logics stimulated the study of so-called very expressive Descrip- 
tion Logics. These languages, besides admitting very general mechanisms for 
defining concepts (for example cyclic definitions, addressed in the next sec- 
tion), provide a richer set of concept-forming constructs and constructs for 
forming complex role expressions. For these languages, the expressiveness is 
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great enough that the new challenge became enriching the language while 
retaining the decidability of reasoning. It is worth pointing out that this new 
direction of theoretical research was accompanied by a corresponding shift 
in the implementation of knowledge representation systems based on very 
expressive DL languages. The study of reasoning methods for very expressive 
Description Logics is addressed in Chapter 5. 


1.3 Knowledge representation in Description Logics 

In the previous section a basic representation language for Description Log- 
ics was introduced along with some key associated reasoning techniques. Our 
goal now is to illustrate how Description Logics can be useful in the design 
of knowledge-based applications, that is to say, how a DL language is used 
in a knowledge representation system that provides a language for defining 
a knowledge base and tools to carry out inferences over it. The realization of 
knowledge systems involves two primary aspects. The first consists in pro- 
viding a precise characterization of a knowledge base; this involves precisely 
characterizing the type of knowledge to be specified to the system as well 
as clearly defining the reasoning services the system needs to provide - the 
kind of questions that the system should be able to answer. The second 
aspect consists in providing a rich development environment where users 
can benefit from different services that can make their interaction with the 
system more effective. In this section we address the logical structure of the 
knowledge base, while the design of systems and tools for the development 
of applications is addressed in the next section. 

One of the products of some important historical efforts to provide pre- 
cise characterizations of the behavior of semantic networks and frames was 
a functional approach to knowledge representation [Levesque, 1984], The 
idea was to give a precise specification of the functionality to be provided 
by a knowledge base and, specifically, of the inferences performed by the 
knowledge base - independent of any implementation. In practice, the func- 
tional description of a reasoning system is productively specified through a 
so-called “Tell&Ask” interface. Such an interface specifies operations that 
enable knowledge base construction (Tell operations) and operations that 
allow one to get information out of the knowledge base (Ask operations). In 
the following we shall adopt this view for characterizing both the definition 
of a DL knowledge base and the deductive services it provides. 

Within a knowledge base one can see a clear distinction between inten- 
sional knowledge , or general knowledge about the problem domain, and ex- 
tensional knowledge , which is specific to a particular problem. A typical DL 
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knowledge base analogously comprises two components - a TBox and an 
ABox. The TBox contains intensional knowledge in the form of a termi- 
nology (hence the term “TBox”, but “taxonomy” could be used as well) 
and is built through declarations that describe general properties of con- 
cepts. Because of the nature of the subsumption relationships among the 
concepts that constitute the terminology, TBoxes are usually thought of 
as having a lattice-like structure; this mathematical structure is entailed 
by the subsumption relationship - it has nothing to do with any imple- 
mentation. The ABox contains extensional knowledge - also called asser- 
tional knowledge (hence the term “ABox”) - knowledge that is specific to 
the individuals of the domain of discourse. Intensional knowledge is usu- 
ally thought not to change - to be “timeless”, in a way - and extensional 
knowledge is usually thought to be contingent, or dependent on a single 
set of circumstances, and therefore subject to occasional or even constant 
change. 

In the rest of the section we present a basic Tell&Ask interface by ana- 
lyzing the TBox and the ABox of a DL knowledge base. 


1.3.1 The TBox 

One key element of a DL knowledge base is given by the operations used to 
build the terminology. Such operations are directly related to the forms and 
the meaning of the declarations allowed in the TBox. 

The basic form of declaration in a TBox is a concept definition, that is, the 
definition of a new concept in terms of other previously defined concepts. 
For example, a woman can be defined as a female person by writing this 
declaration: 


Woman = Person n Female. 

Such a declaration is usually interpreted as a logical equivalence, which 
amounts to providing both sufficient and necessary conditions for classifying 
an individual as a woman. This form of definition is much stronger than the 
ones used in other kinds of representations of knowledge, which typically 
impose only necessary conditions; the strength of this kind of declaration 
is usually considered a characteristic feature of DL knowledge bases. In DL 
knowledge bases, therefore, a terminology is constituted by a set of concept 
definitions of the above form. 

However, there are some important common assumptions usually made 
about DL terminologies: 
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• Only one definition for a concept name is allowed. 

• Definitions are acyclic in the sense that concepts are neither defined in terms of 

themselves nor in terms of other concepts that indirectly refer to them. 

This kind of restriction is common to many DL knowledge bases and implies 
that every defined concept can be expanded in a unique way into a com- 
plex expression containing only atomic concepts by replacing every defined 
concept with the right-hand side of its definition. 

Nebel [1990b] showed that even simple expansion of definitions like this 
gives rise to an unavoidable source of complexity; in practice, however, defi- 
nitions that inordinately increase the complexity of reasoning do not seem to 
occur. Under these assumptions the computational complexity of inferences 
can be studied by abstracting from the terminology and by considering all 
given concepts as fully expanded expressions. Therefore, much of the study 
of reasoning methods in Description Logics has been focused on concept 
expressions and, more specifically, as discussed in the previous section, on 
subsumption, which can be considered the basic reasoning service for the 
TBox. 

In particular, the basic task in constructing a terminology is classification, 
which amounts to placing a new concept expression in the proper place 
in a taxonomic hierarchy of concepts. Classification can be accomplished 
by verifying the subsumption relation between each defined concept in the 
hierarchy and the new concept expression. The placement of the concept 
will be in between the most specific concepts that subsume the new concept 
and the most general concepts that the new concept subsumes. 

More general settings for concept definitions have recently received some 
attention, deriving from attempts to establish formal relationships between 
Description Logics and other formalisms and from attempts to satisfy a need 
for increased expressive power. In particular, the admission of cyclic defini- 
tions has led to different semantic interpretations of the declarations, known 
as greatest/least fixpoint, and descriptive semantics. Although it has been 
argued that different semantics may be adopted depending on the target ap- 
plication, the more commonly adopted one is descriptive semantics, which 
simply requires that all the declarations be satisfied in the interpretation. 
Moreover, by dropping the requirement that on the left-hand side of a defi- 
nition there can only be an atomic concept name, one can consider so-called 
(general) inclusion axioms of the form 

C\ZD 

where C and D are arbitrary concept expressions. Notice that a concept 
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definition can be expressed by two general inclusions. As a result of several 
theoretical studies concerning both the decidability of and implementation 
techniques for cyclic TBoxes, the most recent DL systems admit rather pow- 
erful constructs for defining concepts. 

The basic deduction service for such TBoxes can be viewed as logical 
implication and it amounts to verifying whether a generic relationship (for 
example a subsumption relationship between two concept expressions) is a 
logical consequence of the declarations in the TBox. The issues arising in 
the semantic characterization of cyclic TBoxes are dealt with in Chapter 2, 
while techniques for reasoning in cyclic TBoxes are addressed in Chapter 2 
and in Chapter 5, where very expressive Description Logics are presented. 


1.3.2 The ABox 

The ABox contains extensional knowledge about the domain of interest, that 
is, assertions about individuals, usually called membership assertions. For 
example, 

Female n Person(ANNA) 

states that the individual ANNA is a female person. Given the above defini- 
tion of woman, one can derive from this assertion that ANNA is an instance 
of the concept Woman. Similarly, 

hasChild(ANNA, JACOPO) 

specifies that ANNA has JACOPO as a child. Assertions of the first kind are 
also called concept assertions, while assertions of the second kind are also 
called role assertions. 

As illustrated by these examples, in the ABox one can typically specify 
knowledge in the form of concept assertions and role assertions. In concept 
assertions general concept expressions are typically allowed, while role as- 
sertions, where the role is not a primitive role but a role expression, are 
typically not allowed, being treated in the case of very expressive languages 
only. 

The basic reasoning task in an ABox is instance checking , which verifies 
whether a given individual is an instance of (belongs to) a specified concept. 
Although other reasoning services are usually considered and employed, they 
can be defined in terms of instance checking. Among them we find knowledge 
base consistency, which amounts to verifying whether every concept in the 
knowledge base admits at least one individual; realization, which finds the 
most specific concept an individual object is an instance of; and retrieval, 
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which finds the individuals in the knowledge base that are instances of a 
given concept. These can all be accomplished by means of instance checking. 

The presence of individuals in a knowledge base makes reasoning more 
complex from a computational viewpoint [Donini et al. , 1994b], and may 
require significant extensions of some TBox reasoning techniques. Reasoning 
in the ABox is addressed in Chapter 3. 

It is worth emphasizing that, although we have separated out for conve- 
nience the services for the ABox, when the TBox cannot be dealt with by 
means of the simple substitution mechanism used for acyclic TBoxes, the 
reasoning services may have to take into account all of the knowledge base 
including both the TBox and the ABox, and the corresponding reasoning 
problems become more complex. A full setting including general TBox and 
ABox is addressed in Chapter 5, where very expressive Description Logics 
are discussed. 

More general languages for defining ABoxes have also been considered. 
Knowledge representation systems providing a powerful logical language 
for the ABox and a DL language for the TBox are often considered hy- 
brid reasoning systems, since completely different knowledge representation 
languages may be used to specify the knowledge in the different compo- 
nents. Hybrid reasoning systems were popular in the 1980s (see for example 
[Brachman et al, 1985]); lately, the topic has regained attention [Levy and 
Rousset, 1997; Donini et al, 1998b], focusing on knowledge bases with a 
DL component for concept definitions and a logic-programming component 
for assertions about individuals. Sound and complete inference methods for 
hybrid knowledge bases become difficult to devise whenever there is a strict 
interaction between the knowledge components. 


1.4 From theory to practice: Description Logic systems 

A direct practical result of research on knowledge representation has been 
the development of tools for the construction of knowledge-based applica- 
tions. As already noted, research on Description Logics has been character- 
ized by a tight connection between theoretical results and implementation 
of systems. This has been achieved by maintaining a very close relationship 
between theoreticians, system implementors and users of knowledge repre- 
sentation systems based on Description Logics (DL-KRSs). The results of 
work on reasoning algorithms and their complexity have influenced the de- 
sign of systems, and research on reasoning algorithms has itself been focused 
by a careful analysis of the capabilities and the limitations of implemented 
systems. In this section we first sketch the functionality of some knowledge 
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representation systems and, subsequently, discuss the evolution of DL-KRSs. 
The reader can find a deeper treatment of the first topic in Chapter 7, while 
a survey of knowledge representation systems based on Description Logics 
is provided in Chapter 8. Chapter 9 is devoted to more specialized imple- 
mentation and optimization techniques. 


1.4-1 The design of knowledge representation systems based on 

Description Logics 

In order to appreciate the difficulties of implementing and maintaining a 
knowledge representation system, it is necessary to consider that in the us- 
age of a knowledge representation system, the reasoning service is really only 
one aspect of a complex system, one which may even be hidden from the 
final user. The user, before getting to “push the reasoning button” , has to 
model the domain of interest, and input knowledge into the system. Further, 
in many cases, a simple yes/no answer is of little use, so a simplistic imple- 
mentation of the Tell&Ask paradigm may be inadequate. As a consequence, 
the path one follows to get from the identification of a suitable knowledge 
representation system to the design of applications based on it is a complex 
and demanding one (see for example [Brachman, 1992]). In the case of De- 
scription Logics, this is especially true if the goal is to devise a system to be 
used by users who are not DL experts and who need to obtain a working sys- 
tem as quickly as possible. In the 1980s, when frame-based systems (such as, 
for example, Ivee [Fikes and Kehler, 1985]; see [Karp, 1992] for an overview) 
had reached the strength of commercial products, the burden on a user of 
moving to the more modern DL-KRSs had to be kept small. Consequently, 
a stream of research addressed important aspects of the pragmatic usability 
of DL systems. This issue was especially relevant for those systems aiming 
at limiting the expressiveness of the language, but providing the user with 
sound, complete and efficient reasoning services. The issue of embedding a 
DL language within an environment suitable for application development is 
further addressed in Chapter 7. 

In recent years, we might add, useful DL systems have often come as in- 
ternal components of larger environments whose interfaces could completely 
hide the DL language and its core reasoning services. Systems like Imacs 
[Brachman et al., 1993] and Prose [Wright et al, 1993] were quite successful 
in classifying data and configuring products, respectively, without the need 
for any user to understand the details of the DL representation language 
(Classic) they were built upon. 
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Nowadays, applications for gathering information from the World Wide 
Web, where the interface can be specifically designed to support the retrieval 
of such information, also hide the knowledge representation and reasoning 
component. In addition, some data modeling tools, where the system pro- 
vides a more conventional interface, can provide additional facilities based 
on the capability of reasoning about models with a DL inference engine. The 
possible settings for taking advantage of Description Logics as components 
of larger systems are discussed in Part III; more specifically, Chapter 14 
presents Web applications and Chapter 15 natural language applications, 
while the reasoning capabilities of Description Logics in database applica- 
tions are addressed in Chapter 16. 

1.4-2 Knowledge representation systems based on 
Description Logics 

The history of knowledge representation is covered in the literature in nu- 
merous ways (see for example [Woods and Schmolze, 1992; Rich, 1991; 
Baader et al, 1992b]). Here we identify three generations of systems, high- 
lighting their historical evolution rather than their specific functionality. We 
shall characterize them as Pre-DL systems, DL systems and Current Gen- 
eration DL systems. Detailed references to implemented systems are given 
in Chapter 8. 


1-4 -2.1 Pre- Description Logic systems 

The ancestor of DL systems is Kl-One [Brachman and Schmolze, 1985], 
which signaled the transition from semantic networks to more well-founded 
terminological (description) logics. The influence of Kl-One was profound 
and it is considered the root of the entire family of languages [Woods and 
Schmolze, 1990]. 

Semantic networks were introduced around 1966 as a representation for 
the concepts underlying English words, and became a popular type of frame- 
work for representing a wide variety of concepts in AI applications. Impor- 
tant and commonsensical ideas evolved in this work, from named nodes and 
links for representing concepts and relationships, to hierarchical networks 
with inheritance of properties, to the notion of “instantiation” of a concept 
by an individual object. But semantic network systems were fraught with 
problems, including vagueness and inconsistency in the meaning of various 
constructs, and the lack of a level of structure on which to base application- 
independent inference procedures. In his PhD thesis [Brachman, 1977a] 
and subsequent work (e.g., see [Brachman, 1979]), Brachman addressed 
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representation at what he called an “epistemological” , or knowledge- 
structuring level. This led to a set of primitives for structuring knowledge 
that was less application- and world-knowledge-dependent than “semantic” 
representations (like those for processing natural language case structures), 
yet richer than the impoverished set of primitives available in strictly logical 
languages. The main result of this work was a new knowledge representa- 
tion framework whose primitive elements allowed cleaner, more application- 
independent representations than prior network formalisms. In the late 
1970s, Brachman and his colleagues explored the utility and implications 
of this kind of framework in the Kl-One system. 

Kl-One introduced most of the key notions explored in the extensive 
work on Description Logics that followed. These included, for example, 
the notions of concepts and roles and how they were to be interrelated; 
the important ideas of “value restriction” and “number restriction”, which 
modified the use of roles in the definitions of concepts; and the crucial 
inferences of subsumption and classification. It also sowed the seeds for the 
later distinction between the TBox and ABox and a host of other significant 
notions that greatly influenced subsequent work. Kl-One also was the 
initial example of the substantial interplay between theory and practice 
that characterizes the history of Description Logics. It was influenced 
by work in logic and philosophy (and in turn itself influenced work in 
philosophy and psychology), and significant care was taken in its design 
to allow it to be consistent and semantically sound. But it was also used 
in multiple applications, covering intelligent information presentation and 
natural language understanding, among other things. 

Most of the focus of the original work on Kl-One was on the representa- 
tion of and reasoning with concepts, with only a small amount of attention 
paid to reasoning with individual objects. The first descendants of Kl-One 
were focused on architectures providing a clear distinction between a pow- 
erful logic-based (or rule-based) component and a specialized terminological 
component. These systems came to be referred to as hybrid systems. A major 
research issue was the integration of the two components to provide unified 
reasoning services over the whole knowledge base. 

1. 4-2.2 Description Logic systems 

The earliest “pre-DL” systems derived directly from Kl-One, which, while 
itself a direct result of formal analysis of the shortcomings of semantic net- 
works, was mainly about the implementation of a viable classification algo- 
rithm and the data structures to adequately represent concepts. DL systems, 
per se, which followed as the next generation, were more derived from a wave 
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of theoretical research on terminological logics that resulted from examina- 
tion of Kl-One and some other early systems. This work was initiated in 
roughly 1984, inspired by a paper by Brachman and Levesque [Brachman 
and Levesque, 1984] on the formal complexity of reasoning in Description 
Logics. Subsequent results on the tradeoff between the expressiveness of a 
DL language and the complexity of reasoning with it, and more generally, 
the identification of the sources of complexity in DL systems, showed that a 
careful selection of language constructs was needed and that the reasoning 
services provided by the system are deeply influenced by the set of constructs 
provided to the user. We can thus characterize three different approaches to 
the implementation of reasoning services. The first can be referred to as lim- 
ited + complete, and includes systems that are designed by restricting the set 
of constructs in such a way that subsumption would be computed efficiently, 
possibly in polynomial time. The Classic system [Brachman et al, 1991] is 
the most significant example of this kind. The second approach can be de- 
noted as expressive+incomplete , since the idea is to provide both an expres- 
sive language and efficient reasoning. The drawback is, however, that reason- 
ing algorithms turn out to be incomplete in these systems. Notable examples 
of this kind of system are Loom [MacGregor and Bates, 1987], and Back 
[Nebel and von Luck, 1988] . After some of the sources of incompleteness were 
discovered, often by identifying the constructs - or, more precisely, combi- 
nations of constructs - that would require an exponential algorithm to pre- 
serve the completeness of reasoning, systems with complete reasoning algo- 
rithms were designed. Systems of this sort (see for example Kris [Baader and 
Hollunder, 1991a]) are therefore characterized as expressive+complete ; they 
were not as efficient as those following the other approaches, but they pro- 
vided a testbed for the implementation of reasoning techniques developed in 
the theoretical investigations, and they played an important role in stimulat- 
ing comparison and benchmarking with other systems [Heinsohn et al., 1992; 
Baader et al, 1992b]. 

1-4-2.3 Current generation Description Logic systems 
In the current generation of DL-KRSs, the need for complete algorithms for 
expressive languages has been the focus of attention. The expressiveness of 
the DL language required for reasoning on data models and semistructured 
data has contributed to the identification of the most important extensions 
for practical applications. 

The design of complete algorithms for expressive Description Logic has led 
to significant extensions of tableau-based techniques and to the introduction 
of several optimization techniques, partly borrowed from theorem proving 
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and partly specifically developed for Description Logics. Recent systems that 
allow for expressive DL languages, while providing efficient implementations 
of reasoning are FaCT [Horrocks, 1998b] and Racer [Haarslev and Moller, 
2001e]. 

This research has also been influenced by newly discovered relationships 
between Description Logics and other logics, leading to exchanging bench- 
marks and experimental comparisons with other deduction systems. 

The techniques that have been used in the implementation of very expres- 
sive Description Logics are addressed in detail in Chapter 9. 


1.5 Applications developed with Description Logic systems 

The third component in the picture of the development of Description Log- 
ics is the implementation of applications in different domains. Some of the 
applications created over the years may have only reached the level of pro- 
totype, but many of them have the completeness of industrial systems and 
have been deployed in production use. 

A critical element in the development of applications based on Descrip- 
tion Logics is the usability of the knowledge representation system. We have 
already emphasized that building a tool to be used in the design and im- 
plementation of knowledge-based applications requires significant work to 
make it suitable for interactive development, explanation and debugging, 
interface implementation, and so on. In addition, here we focus on the effec- 
tiveness of Description Logics as a modeling language. A modeling language 
should have intuitive semantics and the syntax must help convey the in- 
tended meaning. To this end, a somewhat different syntax than we have 
seen so far, closer to that of natural language, has often been adopted, and 
graphical interfaces that provide an operational view of the process of knowl- 
edge base construction have been developed. The issues arising in modeling 
application domains using Description Logics are dealt with in Chapter 10, 
and will be briefly addressed in the next subsection. 

It is natural to expect that some classes of applications share similari- 
ties both in methodological patterns and in the design of specific structures 
or reasoning capabilities. Consequently, we identify several application do- 
mains in Subsection 1.5.2; these include software engineering, configuration, 
medicine, and digital libraries and Web-based information systems. 

In Subsection 1.5.3 we consider several application areas where Descrip- 
tion Logics play a major role; these include natural language processing 
and database management, where Description Logics can be used in several 
ways. 
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When addressing the design of applications it is also worth pointing out 
that there has been significant evolution in the way Description Logics have 
been used within complex applications. In particular, the DL-centered view 
that underlies the earliest generation of systems, wherein an application was 
developed in a single environment (the one provided by the DL system) , was 
characterized by very loose interaction, if any, between the DL system and 
other applications. Later, an approach that viewed the Description Logic 
more as a component became evident; in this view the DL system acts as 
a component of a larger environment, typically leaving out functions, such 
those for data management, that are more effectively implemented by other 
technologies. The architecture where the component view is taken requires 
the definition of a clear interface between the components, possibly adopting 
different modeling languages, but focusing on Description Logics for the 
implementation of the reasoning services that can add powerful capabilities 
to the application. Obviously, the choice between the above architectural 
views depends upon the needs of the application at hand. 

Finally, we have already stressed that research in Description Logics has 
benefited from tight interaction between language designers and developers 
of DL-KRSs. Thus, another major impact on the development of DL re- 
search was provided by the implementation of applications using DL-KRSs. 
Indeed, work on DL applications not only demonstrated the effectiveness 
of Description Logics and of DL-KRSs, but also provided mutual feedback 
within the DL community concerning the weaknesses of both the represen- 
tation language and the features of an implemented DL-KRS. 


1.5.1 Modeling with Description Logics 

In order for designers to be able to use Description Logics to model their 
application domains, it is important for the DL constructs to be easily un- 
derstandable; this helps facilitate the construction of convenient to use yet 
effective tools. To this end, the abstract notation that we have previously 
introduced and that is nowadays commonly used in the DL community is 
not fully satisfactory. 

As already mentioned, there are at least two major alternatives for in- 
creasing the usability of Description Logics as a modeling language: 

1. providing a syntax that resembles more closely natural language; 

2. implementing interfaces where the user can specify the representation structures 
through graphical operations. 


Before addressing the above two possibilities, one brief remark is in order. 
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While alternative ways of specifying knowledge, such as natural-language- 
style syntax, can be more appealing to the user, one should remember that 
Description Logics in part arose from a need to respond to the inadequacy - 
the lack of a formal semantic basis - of early semantic networks and frame 
systems. Those early systems often relied on an assumption of intuitive 
readings of natural-language-like constructs or graphical structures, which 
in the end made them unsatisfactory. Therefore, we need to keep in mind 
always the correspondence of the language used by the user and the abstract 
DL syntax, and consequently correspondences with the formal semantics 
should always be clear and available. 

The option of a more readable syntax has been pursued in the major- 
ity of DL-KRSs. In particular, we refer to the concrete syntax proposed 
in [Patel-Schneider and Swartout, 1993], which is based on a LlSP-like no- 
tation, where, for example, the concept of a female person is denoted by 
(and Person Female). Similarly, the concept VhasChild. Female would be 
written (all hasChild Female). In addition, there are shorthand expres- 
sions, such as (the hasChild Female), which indicates the existence of a 
unique female child, and can be phrased using qualified existential restric- 
tion and number restriction. In Chapter 10 this kind of syntax is discussed 
in detail and the possible sources for ambiguities in the natural language 
reading of the constructs are discussed. 

The second option for providing the user with a concrete syntax is to rely 
on a graphical interface. Starting with the Kl-One system, this possibility 
has been pursued by introducing a graphical notation for the representation 
of concepts and roles, as well as their relationships. More recently, Web- 
based interfaces for Description Logics have been proposed [Welty, 1996a]; 
in addition, an XML standard has been proposed [Bechhofer et al, 1999; 
Euzenat, 2001], which is suitable not only for data interchange, but also for 
providing full-fledged Web interfaces to DL-KRSs or applications embodying 
them as components. 

The modeling language is the vehicle for the expression of the modeling 
notions that are provided to the designers. Modeling in Description Logics 
requires the designer to specify the concepts of the domain of discourse and 
characterize their relationships to other concepts and to specific individuals. 
Concepts can be regarded as classes of individuals and Description Logics 
as an object-centered modeling language, since they allow one to introduce 
individuals (objects) and explicitly define their properties, as well as to ex- 
press relationships among them. Concept definition, which provides for both 
necessary and sufficient conditions, is a characteristic feature of Description 
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Logics. The basic relationship between concepts is subsumption, which al- 
lows one to capture various kinds of subclassing mechanisms; however, other 
kinds of relationships can be modeled, such as grouping, materialization, and 
part- whole aggregation. 

The model of a domain in Description Logics is embedded in a knowledge 
base. We have already addressed the TBox-ABox characterization of the 
knowledge base. We recall that the roles of TBox and ABox were motivated 
by the need to distinguish general knowledge about the domain of interest 
from specific knowledge about individuals characterizing a specific world or 
situation under consideration. Besides the TBox-ABox, other mechanisms 
for organizing a knowledge base such as contexts and views have been intro- 
duced in Description Logics. The use of the modeling notions provided by 
Description Logics and the organization of knowledge bases are addressed 
in greater detail in Chapter 10. 

Finally, we recall that Description Logics as modeling languages overlap 
to a large extent with other modeling languages developed in fields such 
as programming languages and database management. While we shall focus 
on this relationship later, we recall here that, when compared to modeling 
languages developed in other fields, the characteristic feature of Description 
Logics is in the reasoning capabilities that are associated with them. In 
other words, we believe that, while modeling has general significance, the 
capability of exploiting the description of the model to draw conclusions 
about the problem at hand is a particular advantage of modeling using 
Description Logics. 


1.5.2 Application domains 

Description Logics have been used (and are being used) in the implementa- 
tion of many systems that demonstrate their practical effectiveness. Some 
of these systems have found their way into production use, despite the fact 
that there was no real commercial platform that could be used for developing 
them. 


1.5. 2.1 Software engineering 

Software engineering was one of the first application domains for Desciption 
Logics undertaken at AT&T, where the Classic system was developed. 
The basic idea was to use a Description Logic to implement a software 
information system, i.e. , a system that would support the software developer 
by helping him or her in finding out information about a large software 
system. 
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More specifically, it was found that the information of interest for soft- 
ware development was a combination of knowledge about the domain of 
the application and code-specific information. However, while the structure 
of the code can be determined automatically, the connection between code 
elements and domain concepts needs to be specified by the user. 

One of the most novel applications of Description Logics is the LaSSiE 
system [Devanbu et al, 1991], which allowed users to incrementally build 
a taxonomy of concepts relating domain notions to the code implementing 
them. The system could thereafter provide useful information in response 
to user queries concerning the code, such as, for example “the function to 
generate a dial tone” . By exploiting the description of the domain, the infor- 
mation retrieval capabilities of the system went significantly beyond those of 
the standard tools used for software development. The LaSSiE system had 
considerable success but ultimately stumbled because of the difficulty of 
maintenance of the knowledge base, given the constantly changing nature of 
industrial software. Both the ideas of a software information system and the 
usage of Description Logics survived that particular application and have 
been subsequently used in other systems. The usage of Description Logics 
in applications for software engineering is described in Chapter 11. 

1.5. 2. 2 Configuration 

One very successful domain for knowledge-based applications built using 
Description Logics is configuration , which includes applications that support 
the design of complex systems created by combining multiple components. 

The configuration task amounts to finding a proper set of components 
that can be suitably connected in order to implement a system that meets a 
given specification. For example, choosing computer components in order to 
build a home PC is a relatively simple configuration task. When the number, 
the type, and the connectivity of the components grow, the configuration 
task can become rather complex. In particular, computer configuration has 
been among the application fields of the first expert systems and can thus 
be viewed as a standard application domain for knowledge-based systems. 
Configuration tasks arise in many industrial domains, such as telecommuni- 
cations, the automotive industry, building construction, etc. 

DL-based knowledge representation systems meet the requirements for 
the development of configuration applications. In particular, they enable 
the object-oriented modeling of system components, which combines 
powerfully with the ability to reason from incomplete specifications and 
to automatically detect inconsistencies. Using Description Logics one 
can exploit the ability to classify the components and organize them 
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within a taxonomy. In addition a DL-based approach supports incremental 
specification and modularity. Applications for configuration tasks require 
at least two features that were not in the original core of DL-KRSs: the 
representation of rules (together with a rule propagation mechanism), and 
the ability to provide explanations. However, extensions with so-called 
“active rules” are now very common in DL-KRSs, and a precise semantic 
account is given in Chapter 2; significant work on explanation capabilities of 
DL-KRSs has been developed in connection with the design of configuration 
applications [McGuinness and Borgida, 1995]. Chapter 12 is devoted 
to the applications developed in Description Logics for configuration 
tasks. 


1.5. 2. 3 Medicine 

Medicine is also a domain where expert systems have been developed since 
the 1980s; however, the complexity of the medical domain calls for a variety 
of uses for a DL-KRS. In practice, decision support for medical diagnosis 
is only one of the tasks in need of automation. One focus has been on the 
construction and maintenance of very large ontologies of medical knowl- 
edge, the subject of some large government initiatives. The need to deal 
with large-scale knowledge bases (hundreds of thousands of concepts) led to 
the development of specialized systems, such as Galen [Rector et al, 1993], 
while the requirement for standardization arising from the need to deal with 
several sources of information led to the adoption of the DL standard lan- 
guage Krss [Patel-Schneider and Swartout, 1993] in projects like Snomed 
[Spackman et al., 1997]. 

In order to cope with the scalability of the knowledge base, the DL lan- 
guage adopted in these applications is often limited to a few basic constructs 
and the knowledge base turns out to be rather shallow, that is to say the 
taxonomy does not have very many levels of subconcepts below the top con- 
cepts. Nonetheless, there are several advanced language features that would 
be very useful in the representation of medical knowledge, such as, for ex- 
ample, specific support for PART-OF hierarchies (see Chapter 10), as well 
as defaults and modalities to capture lack of knowledge (see Chapter 6). 

Obviously, since medical applications most often must be used by doctors, 
a formal logical language is not well-suited; therefore special attention is 
given to the design of the user interface; in particular, natural language 
processing (see Chapter 15) is important both in the construction of the 
ontology and in the operational interfaces. 

Further, the DL component of a medical application usually operates 
within a larger information system, comprising several sources of informa- 
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tion, which need to be integrated in order to provide a coherent view of the 
available data (on this topic see Chapter 16). 

Finally, an important issue that arises in the medical domain is the man- 
agement of ontologies, which not only requires common tools for project 
management, such as versioning systems, but also tools to support knowl- 
edge acquisition and re-use (on this topic see Chapter 8). 

The use of Description Logics specifically in the design of medical appli- 
cations is addressed in Chapter 13. 

1.5. 2. 4 Digital libraries and Web-based information systems 

The relationship between semantic networks and the linked structures im- 
plied by hypertext has motivated the development of DL applications for 
representing bibliographic information and for supporting classification and 
retrieval in digital libraries [Welty and Jenkins, 2000]. These applications 
have proven the effectiveness of Description Logics for representing the tax- 
onomies that are commonly used in library classification schemes, and they 
have shown the advantage of subsumption reasoning for classifying and re- 
trieving information. In these instances, a number of technical questions, 
mostly related to the use of individuals in the taxonomy, have motivated 
the use of more expressive Description Logics. 

The possibility of viewing the World Wide Web as a semantic net- 
work has been considered since the advent of the Web itself. Even in 
the early days of the Web, thought was given to the potential benefits of 
enabling programs to handle not only simple unlabeled navigation struc- 
tures, but also the information content of Web pages. The goal was to 
build systems for querying the Web “semantically”, allowing the user to 
pose queries of the Web as if it were a database, roughly speaking. Based 
on the relationship between Description Logics and semantic networks, 
a number of proposals were developed that used Description Logics to 
model Web structures, allowing the exploitation of DL reasoning capabil- 
ities in the acquisition and management of information [Kirk et al, 1995; 
De Rosa et al. , 1998]. 

More recently, there have been significant efforts based on the use of 
markup languages to capture the information content of Web structures. 
The relationship between Description Logics and markup languages, such as 
XML, has been precisely characterized [Calvanese et al., 1999d], thus iden- 
tifying DL language features for representing XML documents. Moreover, 
interest in the standardization of knowledge representation mechanisms for 
enabling knowledge exchange has led to the development of OWL (Ontol- 
ogy Web Language) [Bechhofer et al., 2004; Patel-Schneider et al., 2004; 



An Introduction to Description Logics 


29 


Smith et al, 2004] which is a product of the research on Description Log- 
ics, and the theoretical investigations on the computability and complexity 
of reasoning in Description Logics [Horrocks et al., 2003]. The use of De- 
scription Logics in the design of digital libraries and Web applications is 
addressed in Chapter 14, with specific discussion on OWL and its standard- 
ization process. 


1.5. 2. 5 Other application domains 

The above list of application domains, while presenting some of the most 
relevant applications designed with DL-KRSs, is far from complete. There 
are many other domains that have been addressed by the DL community. 
Among the application areas that have resorted to Description Logics for 
useful functions are planning and data mining. 

With respect to planning, many knowledge-based applications rely on the 
services of a planning component. While Description Logics do not provide 
such a component themselves, they have been used to implement several 
general-purpose planning systems. The basic idea is to represent plans and 
actions, as well as their constituent elements, as concepts. The system can 
thus maintain a taxonomy of plan types and provide several reasoning ser- 
vices, such as plan recognition, plan subsumption, plan retrieval, and plan 
refinement. Two examples of planning components developed in a DL-KRS 
are Clasp [Yen et al, 1991b], developed on top of Classic, and Expect 
[Swartout and Gil, 1996], developed on top of Loom. In addition, the in- 
tegration of Description Logics and other formalisms, such as constraint 
networks, has been proposed [Weida and Litrnan, 1992], Planning systems 
based on Description Logics have been used in many application domains 
to support planning services in conjunction with a taxonomic representa- 
tion of the domain knowledge. Such application domains include, among 
others, software engineering, medicine, campaign planning, and information 
integration. 

It is worth mentioning that Description Logics have also been used to 
represent dynamic systems and to automatically generate plans based on 
such representations. However, in such cases the use of Description Logics 
is limited to the formalization of properties that characterize the states of 
the system, while plan generation is achieved through the use of a rule 
propagation mechanism [De Giacomo et al, 1999]. Such use of Description 
Logics is inspired by the correspondence between Description Logics and 
Dynamic Logics described in Chapter 5. 

Description Logics have also been used in data mining applications, where 
their inferences can help the process of analyzing large amounts of data. In 
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this kind of application, DL structures can represent views, and DL systems 
can be used to store and classify such views. The classification mechanism 
can help in discovering interesting classes of items in the data. We address 
this type of application briefly in the next subsection on database manage- 
ment. 


1.5.3 Application areas 

From the beginning Description Logics have been considered general- 
purpose languages for knowledge representation and reasoning, and 
therefore suited for many applications. In particular, they were considered 
especially effective for those domains where the knowledge could be easily 
organized along a hierarchical structure, based on the “IS-A” relationship. 
The ability to represent and reason about taxonomies in Description 
Logics has motivated their use as a modeling language in the design and 
maintenance of large, hierarchically structured bodies of knowledge as 
well as their adoption as the representation language for formal ontologies 
[Welty and Guarino, 2001]. 

We now briefly look at some other research areas that have a more gen- 
eral relationship with Description Logics. Such a relationship exists either 
because Description Logics are viewed as a basic representation language, 
as in the case of natural language processing, or because they can be used 
in a variety of ways in concert with the main technology of the area, as in 
the field of database management. 

1.5. 3.1 Natural language 

Description Logics, as well as semantic networks and frames, originally had 
natural language processing as a major field for application (see for example 
[Brachman, 1979]). In particular, when work on Description Logics began, 
not only was a large part of the DL community working on natural language 
applications, but Description Logics also bore a strong similarity to other 
formalisms used in natural language work, such as Feature Logics [Nebel 
and Srnolka, 1991]. 

The use of Description Logics in natural language processing is mainly 
concerned with the representation of semantic knowledge that can be used 
to convey meanings of sentences. Such knowledge is typically concerned with 
the meaning of words (the lexicon), and with context, that is, a representa- 
tion of the situation and domain of discourse. 

A significant body of work has been devoted to the problem of dis- 
ambiguating different syntactic readings of sentences, based on semantic 
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knowledge, a process called semantic interpretation. Moreover, semantic 
knowledge expressed in Description Logics has also been used to support 
natural language generation. 

Since the domain of discourse for a natural language application can be 
arbitrarily broad, work on natural language has also involved the construc- 
tion of ontologies [Welty and Guarino, 2001]. In addition, the expressiveness 
of natural language has led also to investigations concerning extensions of 
Description Logics, such as default reasoning (see Chapter 6). 

Several large projects for natural language processing based on the use 
of Description Logics have been undertaken, some reaching the level of in- 
dustrially deployed applications. They are referenced in Chapter 15, where 
the role of Description Logics in natural language processing is addressed in 
more detail. 


1.5. 3. 2 Database management 

The relationship between Description Logics and databases is rather strong. 
In fact, there is often the need to build systems where both a DL-KRS and 
a DataBase Management System (DBMS) are present. DBMSs deal with 
persistence of data and with the management of large amounts of it, while 
a DL-KRS manages intensional knowledge, typically keeping the knowledge 
base in memory (possibly including assertions about individuals that corre- 
spond to data) . While some of the applications created with DL-KRSs have 
developed ad hoc solutions to the problem of dealing with large amounts 
of persistent data, in a complex application domain it is very likely that a 
DL-KRSs and a DBMS would both be components of a larger system, and 
they would work together. 

In addition, Description Logics provide a formal framework that has been 
shown to be rather close to the languages used in semantic data modeling, 
such as the Entity-Relationship model [Calvanese et al, 1998g] . Description 
Logics are equipped with reasoning tools that can bring to the conceptual 
modeling phase significant advantages, as compared with traditional lan- 
guages, whose role is limited to modeling. For instance, by using concept 
consistency one can verify at design time whether an entity can have at 
least one instance, thus clearly saving all the difficulties arising from dis- 
covering such a situation when the database is being populated [Borgida, 
1995]. 

A second dimension of the enhancement of DBMSs with Description Log- 
ics involves the query language. By expressing the queries to a database in 
a Description Logic one gains the ability to classify them and therefore to 
deal with issues such as query processing and optimization. However, the 
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basic DL machinery needs to be extended in order to deal with conjunctive 
queries; otherwise DL expressiveness with respect to queries is rather lim- 
ited. In addition, Description Logics can be used to express constraints and 
intensional answers to queries. 

A corollary of the relationship between Description Logics and DBMS 
query languages is the utility of Description Logics in reasoning with and 
about views. In the Imacs system [Brachman et al, 1993], the Classic 
language was used as a “lens” [Brachman, 1994] with which data in a con- 
ventional relational database could be viewed. The interface to the data was 
made significantly more appropriate for a data analyst, and views that were 
found to be productive could be saved; in fact, they were saved in a tax- 
onomy and could be classified with respect to one another. In a sense, this 
allows the schema to be viewed and queried explicitly, something normally 
not available when using a raw DBMS directly. 

A more recent use of Description Logics is concerned with so-called “semi- 
structured” data models [Calvanese et al, 1998c], which are being proposed 
in order to overcome the difficulties in treating data that are not structured 
in a relational form, such as data on the Web, data in spreadsheets, etc. In 
this area Description Logics are sufficiently expressive to represent models 
and languages that are being used in practice, and they can offer significant 
advantages over other approaches because of the reasoning services they 
provide. 

Another problem that has recently increased the applicability of Descrip- 
tion Logics is information integration. As already remarked, data are nowa- 
days available in large quantities and from a variety of sources. Information 
integration is the task of providing a unique coherent view of the data stored 
in the sources available. In order to create such a view, a proper relationship 
needs to be established between the data in the sources and the unified view 
of the data. Description Logics not only have the expressiveness needed in 
order to model the data in the sources, but their reasoning services can help 
in the selection of the sources that are relevant for a query of interest, as 
well as to specify the extraction process [Calvanese et al, 2001c]. 

The uses of Description Logics with databases are addressed in more detail 
in Chapter 16. 


1.6 Extensions of Description Logics 

In this section we look at several types of extensions that have been proposed 
for Description Logics; these are addressed in more detail in Chapter 6. Such 
extensions are generally motivated by needs arising in applications. Unfor- 
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tunately, some extended features in implemented DL-KRSs were created 
without precise, formal accounts; in some other cases, such accounts have 
been provided using a formal framework that is not restricted to first-order 
logic. 

A first group of extensions has the purpose of adding to DL languages 
some representational features that were common in frame systems or that 
are relevant for certain classes of applications. Such extensions provide a rep- 
resentation of some novel epistemological notions and address the reasoning 
problems that arise in the extended framework. 

Extensions of a second sort are concerned with reasoning services that are 
useful in the development of knowledge bases but are typically not provided 
by DL-KRSs. The implementation of such services relies on additional infer- 
ence techniques that are considered non-standard, because they go beyond 
the basic reasoning services provided by DL-KRSs. 

Below we first address the extensions of the knowledge representation 
framework and then non-standard inferences. 


1.6.1 Language extensions 

Some of the research associated with language extensions has investigated 
the semantics of the proposed extensions, but often the emphasis is only 
on finding reasoning procedures for the extended languages. Within these 
language extensions we find constructs for non-monotonic, epistenric, and 
temporal reasoning, and constructs for representing belief and uncertain and 
vague knowledge. In addition some constructs address reasoning in concrete 
domains. 


1.6. 1.1 Non-monotonic reasoning 

When frame-based systems began to be formally characterized as fragments 
of first-order logic, it became clear that those frame-based systems as well 
as some DL-KRSs that were used in practice occasionally provided the user 
with constructs that could not be given a precise semantic characterization 
within the framework of first-order logic. Notable among the problematic 
constructs were those associated with the notion of defaults, which over 
time have been extensively studied in the field of non-monotonic reasoning 
[Brachman, 1985]. 

While one of the problems arising in semantic networks was the oft-cited 
so-called “Nixon diamond” [Reiter and Criscuolo, 1981], a whole line of 
research in non-monotonic reasoning was developed in trying to character- 
ize the system behavior by studying structural properties of networks. For 
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example, the general property that “birds fly” might not be inherited by a 
penguin, because a rule that penguins do not fly would give rise to an arc in 
the network that would block the default inference. But as soon as the net- 
work becomes relatively complex (see for example [Touretzky et al, 1991]), 
we can see that attempts to provide semantic characterization in terms of 
network structure are inadequate. 

Another approach that has been pursued in the formalization of non- 
monotonic reasoning in semantic networks is based on the use of default 
logic [Reiter, 1980; Etherington, 1987; Nado and Fikes, 1987]. Following a 
similar approach is the treatment of defaults in DL-based systems [Baader 
and Hollunder, 1995a], where formal tools borrowed from work on non- 
monotonic reasoning have been adapted to the framework of Description 
Logics. Such adaptation is non-trivial, however, because Description Logics 
are not, in general, propositional languages. 


1.6. 1.2 Modal representation of knowledge and belief 
Modal logics have been widely studied to model a variety of features that 
in first-order logic would require the application of special constraints on 
certain elements of the formalization. For example, the notions of know- 
ing something or believing that some sentence is true can be captured by 
introducing modal operators, which characterize properties that sentences 
have. 

For instance the assertion 


B( Married (ANNA)) 

states a fact explicitly concerning the system’s beliefs (the system believes 
that Anna is married), rather than asserting the truth of something about 
the world being modeled (the system could believe something to be true 
without firm knowledge about its truth in the world). 

In general, by introducing a modal operator one gains the ability to model 
properties like knowledge, belief, time-dependence, obligation, and so on. 
On the one hand, extensions of Description Logics with modal operators 
can be viewed very much like the corresponding modal extensions of first- 
order logic. In particular, the semantic issues arising in the interpretation 
of quantified modal sentences (i.e., sentences with modal operators appear- 
ing inside the scope of quantifiers) are the same. On the other hand, the 
syntactic restrictions that are suited to a DL language lead to formalisms 
whose expressiveness and reasoning problems inherit some of the features 
of a specialized DL language. Extensions of Description Logics with modal 
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operators including those for representing knowledge and belief are discussed 
in [Baader and Ohlbach, 1995]. 

1.6. 1.3 Epistemic reasoning 

It is not sufficient to provide a semantics for defaults to obtain a full se- 
mantic account of frame-based systems. Frame-based systems have included 
procedural rules as well as other forms of closure and epistemic reasoning 
that need to be covered by the semantics as well as by the reasoning algo- 
rithms. In particular, if one looks at the most widely-used systems based on 
Description Logics, such features are still present, possibly in new flavors, 
while their semantics is given informally and the consequences of reasoning 
sometimes not adequately explained. 

Among the non-first-order features that are used in the practice of 
knowledge-based applications in both DL-based and frame-based systems 
we point out these: 

• procedural rules (also called trigger rules), which are normally described as if-then 
statements and are used to infer new facts about known individuals; 

• default rules, which enable default reasoning in inheritance hierarchies; 

• role closure, which limits the reasoning involving role restrictions to the individ- 
uals explicitly in the knowledge base; 

• integrity constraints, which provide consistency restrictions on admissible knowl- 
edge bases. 

In Chapter 6, among other approaches an epistemic extension of Descrip- 
tion Logics with a modal operator is addressed. In the resulting formalism 
[Donini et al, 1998a] one can express epistemic queries and, by admitting a 
simple form of epistemic sentences in the knowledge base, one can formal- 
ize the aforementioned procedural rules. This characterization of procedural 
rules in terms of an epistemic operator has been widely accepted in the DL 
community and is thus also included in Chapter 2. The approach has been 
further extended to what have been called Autoepistemic Description Logics 
(ADLs) [Donini et al., 1997b, 2002], where it is combined with default rea- 
soning. This combination is achieved by relying on the non-monotonic modal 
logic MKNF [Lifschitz, 1991], thus introducing a second modal operator in- 
terpreted as autoepistemic assumption. The features mentioned above can 
be uniformly treated as epistemic sentences in the knowledge base, without 
the need to give them special status as in the case of procedural rules, 
defaults, and epistemic constraints on the knowledge base. This expres- 
siveness does not come without making reasoning more difficult. An exten- 
sion of the reasoning methods available for deduction in the propositional 
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formalizations of non-monotonic reasoning to the fragment of first-order 
logic corresponding to Description Logics has nonetheless been shown to be 
decidable. 


1.6. 1.4 Temporal reasoning 

One notion that is often required in the formalization of application do- 
mains is time. Temporal extensions of Description Logics have been treated 
as a special kind of modal extension. The first proposal for handling time 
in a DL framework [Schmiedel, 1990] was originated in the context of the 
DL system Back. Later, following the standard approaches in the represen- 
tation of time, both interval-based and point-based approaches have been 
studied, specifically focusing on the decidability and complexity of the rea- 
soning problems (see [Art ale and Franconi, 2001] for a survey the temporal 
extensions of Description Logics). 

Time intervals can also be treated as a form of concrete domain (see 
below) . 


1.6. 1.5 Representation of uncertain and vague knowledge 

Another aspect of knowledge that is sometimes useful in representing and 
reasoning about application domains is uncertainty. As in other knowledge 
representation frameworks there are several approaches to the representation 
of uncertain knowledge in Description Logics. Two of them, namely proba- 
bilistic logic and fuzzy logic, have been proposed in the context of Descrip- 
tion Logics. In the case of probabilistic Description Logics [Heinsohn, 1994; 
Jaeger, 1994] the knowledge about the domain is expressed in terms of prob- 
abilistic terminological axioms, which allow one to represent statistical in- 
formation about the domain, and in terms of probabilistic assertions, which 
specify the degree of belief of asserted properties. The reasoning tasks aim at 
finding the probability bounds for subsumption relations and assertions. A 
more recent line of work tries to combine Description Logics with Bayesian 
networks. 

In the case of fuzzy Description Logics [Yen, 1991] the goal is to character- 
ize notions that cannot be properly defined with a “crisp” numerical bound. 
For example, the concept of living near Rome cannot be always defined with 
a crisp boundary on the map, but must be represented with a membership 
or degree function, which expresses closeness to the city in a continuous 
way. 

Proposed approaches to fuzzy Description Logics not only define the se- 
mantics of assertions in terms of fuzzy sets, but also introduce new operators 
to express notions like “mostly” , “very” , etc. Reasoning algorithms are also 
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provided for computing fuzzy subsumption within the framework of tableau- 
based methods. 


1.6. 1.6 Concrete domains 

One of the limitations of basic Description Logics is related to the difficulty of 
integrating knowledge (and, consequently, performing reasoning) of specific 
domains, such as numbers or strings, which are needed in many applications. 
For example, in order to model the concept of a young person it seems rather 
natural to introduce the (functional) role age and to use a concrete value (or 
range of values) in the definition of the concept. In addition, one would like 
to be able to conclude that a person of school age is also a young person. 
Such a conclusion might require the use of properties of numbers to establish 
that the expected subsumption relation holds. 

While for some time such extensions were designed in ad hoc ways, in 
[Baader and Hanschke, 1991a] a general method was established for integrat- 
ing knowledge about concrete domains within a DL language. If a domain 
can be properly formalized, it is shown that the tableau-based reasoning 
technique can be suitably extended to handle the reasoning services in the 
extended language. 

Concrete domains include not only data types such as numerical types, 
but also more elaborate domains, such as tuples of the relational calculus, 
spatial regions, or time intervals. 


1.6.2 Additional reasoning services 

Non-standard inference tasks can serve a variety of purposes, among them 
support in building and maintaining the knowledge base, as well as in ob- 
taining information about the knowledge represented in it. 

Among the more useful non-standard inference tasks in Description Logics 
we find the computation of the least common subsumer and the most specific 
concept, matching/unification, and concept rewriting. 

1.6. 2.1 Least common subsumer and most specific concept 
The least common subsumer ( les ) of a set of concepts is the minimal con- 
cept that subsumes all of them. The minimality condition implies there is 
no other concept that subsumes all the concepts in the set and is less gen- 
eral (subsumed by) the lcs. This notion was first studied in [Cohen et ai, 
1992] and it has subsequently been used for several tasks: inductive learn- 
ing of concept description from examples; knowledge base vivification (as a 
way to represent disjunction in languages that do not admit it); and in the 
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bottom-up construction of DL knowledge bases (starting from instances of 
the concepts). 

The notion of lcs is closely related to that of most specific concept ( msc ) 
of an individual, i.e., the least concept description that the individual is 
an instance of, given the assertions in the knowledge base; the minimality 
condition is specified as before. More generally, one can define the msc of a 
set of assertions about individuals as the lcs of the msc associated with each 
individual. Based on the computation of the msc of a set of assertions about 
individuals one can incrementally construct a knowledge base [Baader and 
Kiisters, 1999]. 

It interesting to observe that the techniques that have been proposed to 
compute the lcs and nrcs rely on compact representations of concept expres- 
sions, which are built either following the structural subsumption approach, 
or through the definition of a well-suited normal form. 


1.6. 2. 2 Unification and matching 

Another tool to support the construction and maintenance of DL knowledge 
bases that goes beyond the standard inference services provided by DL-KRSs 
is the unification of concepts. 

Concept unification [Baader and Narendran, 1998] is an operation that 
can be regarded as weakening the equivalence between two concept expres- 
sions. More precisely, two concept expressions unify if one can find a sub- 
stitution of concept variables into concept expressions such that the result 
of applying the substitution gives equivalent concepts. The intuition is that, 
in order to find possible overlaps between concept definitions, one can treat 
certain concept names as variables and discover, via unification, that two 
concepts (possibly independently defined by distinct knowledge designers) 
are in fact equivalent. The knowledge base can consequently be simplified 
by introducing a single definition of the unifiable concepts. 

As usual, matching is defined as a special case of unification, where vari- 
ables occur only in one of the two concept expressions. In addition, in the 
framework of Description Logics, one can define matching and unification 
based on the subsumption relation instead of equivalence [Baader et al, 
1999a]. 

As with other non-standard inferences, the computation of matching and 
unification relies on the use of specialized representations for concept expres- 
sions, and it has been shown to be decidable for rather simple Description 
Logics. 
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1.6. 2. 3 Concept rewriting 

Finally, there has been a significant body of work on the problem of concept 
rewriting. Given a concept expressed in a source language, concept rewriting 
amounts to finding a concept, possibly expressed in a target language, which 
is related to the given concept according to equivalence, subsumption, or 
some other relation. 

In order to specify the rewriting, one can provide a suitable set of con- 
straints between concepts in the source language and concepts in the target 
language. Concept rewriting can be applied to the translation of concepts 
from one knowledge base to another, or in the reformulation of concepts 
during the process of knowledge base construction and maintenance. 

In addition, concept rewriting has been addressed in the context of the 
rewriting of queries using views, in database management (see also Chap- 
ter 16), and has recently been investigated in the framework of information 
integration. In this setting, one can apply concept rewriting techniques to 
automatically generate the queries that enable a system to gather informa- 
tion from a set of sources [Beeri et al, 1997]. Given an initial specification of 
the query according to a common, global language, and a set of constraints 
expressing the relationship between the global schema and the individual 
sources where information is stored, the problem is to compute the queries 
to be posed to the local sources that provide answers, possibly approximate, 
to the original query [Calvanese et al., 2000a]. 

1.7 Relationship to other fields of Computer Science 

Description Logics were developed with the goals of providing formal, declar- 
ative meanings to semantic networks and frames, and of showing that such 
representation structures can be equipped with efficient reasoning tools. 
However, the underlying ideas of concept /class and hierarchical structure 
based upon the generality and specificity of a set of classes have appeared 
in many other fields of Computer Science, such as database management 
and programming languages. Consequently, there have been a number of 
attempts to find commonalities and differences among formalisms with sim- 
ilar underlying notions, but which were developed in different fields. More- 
over, by looking at the syntactic form of Description Logics - logics that 
are restricted to unary and binary predicates and allow restricted forms of 
quantification - other logical formalisms that have strong relationships with 
Description Logics have been identified. In this section we briefly address 
such relationships; in particular, we focus our attention on the relationship 
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of Description Logics to other class-based languages, and then we address 
the relationship between Description Logics and other logics. These topics 
are addressed in more detail in Chapter 4. 


1.7.1 Description Logics and other class-based formalisms 

As we have mentioned, Description Logics can, in principle, be related to 
other class-based formalisms. Before looking at other fields, it is worth re- 
lating Description Logics to other formalisms developed within the field of 
knowledge representation that share the intuitions underlying network-based 
representation structure. In [Lehmann, 1992] several languages aiming at 
structured representations of knowledge are reviewed. We have already dis- 
cussed the relationship between Description Logics and semantic networks 
and frames, since they provided the basic motivations for developing De- 
scription Logics in the first place. Among others, conceptual graphs [Sowa, 
1991] have been regarded as a way of representing conceptual structures 
very closely related to semantic networks (and consequently, to Descrip- 
tion Logics). However, only recently has there been a detailed analysis of 
the relationship between conceptual graphs and Description Logics [Baader 
et ai, 1999c]. The outcome of this work makes it apparent that, although 
one can establish a relationship between simple conceptual graphs and a 
DL language, there are substantial differences between the two formalisms. 
The most significant one is that Description Logics are characterized by 
the universally quantified role restriction, which is not present in conceptual 
graphs. Consequently, the interpretation of the representation structures be- 
comes substantially different. 

In many other fields of Computer Science we find formalisms for the repre- 
sentation of objects and classes [Motschnig-Pitrik and Mylopoulous, 1992], 
Such formalisms share the notion of a class that denotes a subset of the do- 
main of discourse, and they allow one to express several kinds of relationships 
and constraints (e.g., subclass constraints) that hold among classes. More- 
over, class-based formalisms aim at taking advantage of the class structure in 
order to provide various types of information, such as whether an element be- 
longs to a class, whether a class is a subclass of another class, and more gen- 
erally, whether a given constraint holds between two classes. In particular, 
formalisms that are built upon the notions of class and class-based hierar- 
chies have been developed in the field of database management, in semantic 
data modeling (see for example [Hull and King, 1987]), in object-oriented 
languages (see for example [Kim and Lochovsky, 1989]), and more generally, 
in programming languages (see for example [Lenzerini et ai, 1991]). 
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There have been several attempts to establish relationships among the 
class-based formalisms developed in different fields. In particular, the com- 
mon intuitions behind classes and concepts have stimulated several pieces 
of work aimed at establishing a precise relationship between class-based for- 
malisms and Description Logics. However, it is difficult to find a common 
framework for carrying out a precise comparison. 

In Chapter 4 a specific Description Logic is taken as a basis for identify- 
ing the common features of frame systems and object-oriented and semantic 
data models (see also [Calvanese et al., 1999e]). Specifically, a precise corre- 
spondence between the chosen DL and the Entity-Relationship model [Chen, 
1976], as well as with an object-oriented language in the style of [Abiteboul 
and Kanellakis, 1989], is presented there. 

This kind of comparison shows that one can indeed identify a large com- 
mon basis, but also that there are features that are currently missing in each 
formalism. For example, to capture semantic data models one needs a cyclic 
form of inclusion assertion, as well as the inverses of roles for modeling rela- 
tionships that work in both directions, while DL roles have a directionality 
from one concept to another. Moreover, in order to make a comparison with 
frame-based systems, one has to leave out both the non-monotonic features 
of frames, such as defaults and closures (which are addressed among the 
extensions of Description Logics in the previous section) and their dynamic 
aspects such as daemons and and triggers (with the exception of trigger rules, 
which are also addressed in the previous section). Finally, with respect to 
object-oriented data models the main difference is that although Description 
Logics provide the expressiveness to model record and set structures, they 
are not explicitly available in Description Logics and thus their representa- 
tion is a little cumbersome. On the other hand, semantic and object-oriented 
data models are typically not equipped with reasoning tools that are avail- 
able with Description Logics. This issue is further developed in Chapter 16, 
where the applications of Description Logics in the field of database man- 
agement are addressed. However, if the language is sufficiently expressive, 
as it needs to be in order to establish relationships among various class- 
based formalisms, one needs to distinguish between finite model reasoning, 
which is required for database languages that are designed to represent a 
closed domain of discourse, and unrestricted reasoning, which is typical of 
knowledge representation formalisms and, therefore, of Description Logics. 


1.7.2 Relationships to other logics 

The initial observation for addressing the relationship of Description Logics 
to other logics is the fact that Description Logics are subsets of first-order 
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logic. This has been known since the earliest days of Description Logics, and 
has been thoroughly investigated in [Borgida, 1996]. In fact, the Description 
Logic ACC corresponds to the fragment of first-order logic obtained by re- 
stricting the syntax to formulas containing two variables. The importance 
of this and subsequent studies on this issue is related to finding adequate 
characterizations of the expressiveness of Description Logics. 

Since Description Logics focus on a language formed by unary and binary 
predicates, it turned out that they are closely related to modal languages, 
if one regards roles as accessibility relations. In particular, Schild [1991] 
pointed out that some Description Logics are notational variants of cer- 
tain propositional modal logics; specifically, the Description Logic ACC has 
a modal logic counterpart, namely the multi-modal version of the logic K 
(see [Halpern and Moses, 1992]). Actually, A£C-concepts and formulas in 
multi-modal K can immediately be translated into each other. Moreover, 
an ACC - concept is satisfiable if and only if the corresponding K-formula is 
satisfiable. Research in the complexity of the satisfiability problem for modal 
propositional logics was initiated quite some time before the complexity of 
Description Logics was investigated. Consequently, this relationship made 
it possible to borrow from modal logic complexity results, reasoning tech- 
niques, and language constructs that had not been previously considered 
in Description Logics. On the other hand, there are features of Description 
Logics that did not have counterparts in modal logics and therefore needed 
ad hoc extensions of the reasoning techniques developed for modal logics. 
In particular, number restrictions as well as the treatment of individuals 
in the ABox required specific treatments based on the idea of reification, 
which amounts to expressing the extensions through a special kind of axiom 
within the logic. Finally, we mention that recent work has pointed out a 
relationship between Description Logics and guarded fragments, which can 
be regarded as generalizations of modal logics. Most of the research on very 
expressive Description Logics, addressed in Chapter 5, has its roots in the 
correspondence with modal logic. 


1.8 Conclusion 

From their humble origins in the late 1970s as a remedy for logical and se- 
mantic problems in frame and semantic network representations, Description 
Logics have grown to be a unique and important keystone in the history of 
knowledge representation. DL formalisms certainly evoked interest in their 
earliest days, with the invention and application of the Kl-One system, but 
international attention and research was given a significant boost in 1984 
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when Brachman and Levesque used the simple and intuitive structure of 
Description Logics as the basis for their observation about the tradeoff be- 
tween knowledge representation language expressiveness and computational 
complexity of reasoning. The way Description Logics were able to separate 
out the structure of concepts and roles into simple term-forming operators 
opened the door to extensive analysis of a broad family of languages. One 
could add and subtract these operators to and from the language and explore 
both the computational ramifications and the relationship of the resulting 
language to other formal languages in Computer Science, such as modal 
logics and data models for database systems. 

As a result, the family of Description Logic languages is probably the most 
thoroughly understood set of formalisms in all of knowledge representation. 
The computational space has been thoroughly mapped out, and a wide 
variety of systems have been built, testing out different styles of inference 
computation and being used in many applications. 

Description Logics are responsible for many of the cornerstone notions 
used in knowledge representation and reasoning. They helped crystallize 
many of the ideas treated informally in earlier notations, such as concepts 
and roles. But they added many new important building blocks for later 
work in the field: the terminology /assertion distinction (TBox/ABox), 
number and value restrictions on roles, internal structure for concepts, 
Tell&Ask interfaces, and others. They have been the subject of a great deal 
of comparison and analysis with their cousins in other fields of Computer 
Science, and DL systems run the gamut from simple, restricted systems with 
provably advantageous computational properties to extremely expressive 
systems that can support very powerful applications. Perhaps the most 
important aspect of work on Description Logics has been the very tight cou- 
pling between theory and practice. The exemplary give-and-take between 
the formal, analytical side of the field and the pragmatic, implemented side 
- notable throughout the entire history of Description Logics - has been a 
role model for other areas of AI. 
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Abstract 

This chapter provides an introduction to Description Logics as a formal 
language for representing knowledge and reasoning about it. It first gives 
a short overview of the ideas underlying Description Logics. Then it intro- 
duces syntax and semantics, covering the basic constructors that are used 
in systems or have been introduced in the literature, and the way these con- 
structors can be used to build knowledge bases. Finally, it defines the typical 
inference problems, shows how they are interrelated, and describes different 
approaches for effectively solving these problems. Some of the topics that 
are only briefly mentioned in this chapter will be treated in more detail in 
subsequent chapters. 


2.1 Introduction 

As sketched in the previous chapter, Description Logics is the most re- 
cent name 1 for a family of knowledge representation (KR) formalisms that 
represent the knowledge of an application domain (the “world”) by first 
defining the relevant concepts of the domain (its terminology), and then 
using these concepts to specify properties of objects and individuals oc- 
curring in the domain (the world description). As the name Description 
Logics indicates, one of the characteristics of these languages is that, un- 
like some of their predecessors, they are equipped with a formal, logic-based 
semantics. Another distinguished feature is the emphasis on reasoning as a 
central service: reasoning allows one to infer implicitly represented knowl- 
edge from the knowledge that is explicitly contained in the knowledge base. 

1 Previously used names are terminological knowledge representation languages, concept lan- 
guages, term subsumption languages, and KL-ONE-based knowledge representation languages. 
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Description Logics support inference patterns that occur in many appli- 
cations of intelligent information processing systems, and which are also 
used by humans to structure and understand the world: classification of 
concepts and individuals. Classification of concepts determines subconcept- 
superconcept relationships (called subsumption relationships in Description 
Logics) between the concepts of a given terminology, and thus allows one 
to structure the terminology in the form of a subsumption hierarchy. This 
hierarchy provides useful information on the connection between different 
concepts, and it can be used to speed up other inference services. Classi- 
fication of individuals (or objects) determines whether a given individual 
is always an instance of a certain concept (i.e., whether this instance rela- 
tionship is implied by the description of the individual and the definition 
of the concept). It thus provides useful information on the properties of an 
individual. Moreover, instance relationships may trigger the application of 
rules that insert additional facts into the knowledge base. 

Because Description Logics are a KR formalism, and since in KR one 
usually assumes that a KR system should always answer the queries of a user 
in reasonable time, the reasoning procedures DL researchers are interested in 
are decision procedures, i.e., unlike first-order theorem provers, for example, 
these procedures should always terminate, both for positive and for negative 
answers. Since the guarantee of an answer in finite time need not imply 
that the answer is given in reasonable time, investigating the computational 
complexity of a given Description Logic with decidable inference problems 
is an important issue. Decidability and complexity of the inference problems 
depend on the expressive power of the Description Logic at hand. On the 
one hand, very expressive Description Logics are likely to have inference 
problems of high complexity, or they may even be undecidable. On the other 
hand, very weak Description Logics (with efficient reasoning procedures) 
may not be sufficiently expressive to represent the important concepts of a 
given application. As mentioned in the previous chapter, investigating this 
tradeoff between the expressivity of Description Logics and the complexity 
of their reasoning problems has been one of the most important issues in 
DL research. 

Description Logics are descended from so-called “structured inheritance 
networks” [Brachman, 1977b, 1978], which were introduced to overcome the 
ambiguities of early semantic networks and frames, and which were first re- 
alized in the system Kl-One [Brachman and Schmolze, 1985]. The following 
three ideas, first put forward in Brachman’s work on structured inheritance 
networks, have largely shaped the subsequent development of Description 
Logics: 
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• The basic syntactic building blocks are atomic concepts (unary predicates), 
atomic roles (binary predicates), and individuals (constants). 

• The expressive power of the language is restricted in that it uses a rather small set 
of (epistemologically adequate) constructors for building complex concepts and 
roles. 

• Implicit knowledge about concepts and individuals can be inferred automatically 
with the help of inference procedures. In particular, subsumption relationships 
between concepts and instance relationships between individuals and concepts 
play an important role: unlike IS-A links in semantic networks, which are explicitly 
introduced by the user, subsumption relationships and instance relationships are 
inferred from the definition of the concepts and the properties of the individuals. 

After the first logic-based semantics for I\L-ONE-like KR languages were 
proposed, the inference problems like subsumption could also be pro- 
vided with a precise meaning, which led to the first formal investiga- 
tions of the computational properties of such languages. It has turned out 
that the languages used in early DL systems were too expressive, which 
led to undecidability of the subsumption problem [Schmidt-Schaufi, 1989; 
Patel-Schneider, 1989b]. The first worst-case complexity results [Levesque 
and Brachman, 1987; Nebel, 1988] showed that the subsumption problem is 
intractable (i.e., not polynomially solvable) even for very inexpressive lan- 
guages. As mentioned in the previous chapter, this work was the starting 
point of a thorough investigation of the worst-case complexity of reasoning 
in KL-ONE-like KR languages (see Chapter 3 for details). 

Later on it has turned out, however, that intractability of reasoning (in 
the sense of being non-polynomial in the worst case) does not prevent a 
Description Logic from being useful in practice, provided that sophisticated 
optimization techniques are used when implementing a system based on such 
a Description Logic (see Chapter 9). When implementing a DL system, the 
efficient implementation of the basic reasoning algorithms is not the only 
issue, though. On the one hand, the derived system services (such as clas- 
sification, i.e., constructing the subsumption hierarchy between all concepts 
defined in a terminology) must be optimized as well [Baader et al, 1994], 
On the other hand, one needs a good user and application programming 
interface (see Chapter 7 for more details). Most implemented DL systems 
provide for a rule language, which can be seen as a very simple, but effective, 
application programming mechanism (see Subsection 2.2.5 for details). 

Section 2.2 introduces the basic formalism of Description Logics. By way 
of a prototypical example, it first introduces the formalism for describing 
concepts (i.e., the description language), and then defines the terminological 
(TBox) and the assertional (ABox) formalisms. Next, it introduces the basic 
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Fig. 2.1. Architecture of a knowledge representation system based on Description 
Logics. 

reasoning problems and shows how they are related to each other. Finally, 
it defines the rule language that is available in many of the implemented DL 
systems. 

Section 2.3 describes algorithms for solving the basic reasoning problems 
in Description Logics. After shortly sketching structural subsumption algo- 
rithms, it concentrates on tableau-based algorithms. Finally, it comments 
on the problem of reasoning w.r.t. terminologies. 

Finally, Section 2.4 describes some additional language constructors that 
are not included in the prototypical family of description languages intro- 
duced in Section 2.2, but have been considered in the literature and are 
available in some DL systems. 


2.2 Definition of the basic formalism 

A KR system based on Description Logics provides facilities to set up knowl- 
edge bases, to reason about their content, and to manipulate them. Fig- 
ure 2.1 sketches the architecture of such a system (see Chapter 8 for more 
information on DL systems). 

A knowledge base (KB) comprises two components, the TBox and the 
ABox. The TBox introduces the terminology , i.e. , the vocabulary of an appli- 
cation domain, while the ABox contains assertions about named individuals 
in terms of this vocabulary. 

The vocabulary consists of concepts , which denote sets of individuals, and 
roles, which denote binary relationships between individuals. In addition to 
atomic concepts and roles (concept and role names), all DL systems allow 
their users to build complex descriptions of concepts and roles. The TBox can 
be used to assign names to complex descriptions. The language for building 
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descriptions is a characteristic of each DL system, and different systems are 
distinguished by their description languages. The description language has a 
model-theoretic semantics. Thus, statements in the TBox and in the ABox 
can be identified with formulae in first-order logic or, in some cases, a slight 
extension of it. 

A DL system not only stores terminologies and assertions, but also offers 
services that reason about them. Typical reasoning tasks for a terminology 
are to determine whether a description is satisfiable (i.e., non-contradictory), 
or whether one description is more general than another one, that is, whether 
the first subsumes the second. Important problems for an ABox are to find 
out whether its set of assertions is consistent, that is, whether it has a model, 
and whether the assertions in the ABox entail that a particular individual 
is an instance of a given concept description. Satisfiability checks of descrip- 
tions and consistency checks of sets of assertions are useful to determine 
whether a knowledge base is meaningful at all. With subsumption tests, one 
can organize the concepts of a terminology into a hierarchy according to 
their generality. A concept description can also be conceived as a query, de- 
scribing a set of objects one is interested in. Thus, with instance tests, one 
can retrieve the individuals that satisfy the query. 

In any application, a KR system is embedded into a larger environment. 
Other components interact with the KR component by querying the knowl- 
edge base and by modifying it, that is, by adding and retracting concepts, 
roles, and assertions. A restricted mechanism for adding assertions uses rules. 
Rules are an extension of the logical core formalism, which can still be inter- 
preted logically. However, many systems, in addition to providing an appli- 
cation programming interface that consists of functions with a well-defined 
logical semantics, provide an escape hatch by which application programs 
can operate on the KB in arbitrary ways. 


2.2.1 Description languages 

We assume there are two kinds of symbols, which we call atomic concepts 
and atomic roles. In abstract notation, we use the letters A and B for atomic 
concepts and the letter R for atomic roles. We refer to atomic concepts and 
roles as atomic symbols. Atomic concepts and roles are elementary descrip- 
tions from which we inductively build complex descriptions by means of 
concept constructors and role constructors. In this chapter, we will mostly 
restrict the attention to concept constructors. Role constructors are intro- 
duced in Subsection 2.4.1. We use the letters C and D for arbitrary concept 
descriptions. 



52 


F. Baader and W. Nutt 


Description languages are distinguished by the constructors they provide. 
In the sequel we shall discuss various languages from the family of Ad- 
languages. The language Ad (= attributive language) has been introduced 
in [Schmidt-Schaufi and Srnolka, 1991] as a minimal language that is of 
practical interest. The other languages of this family are extensions of Ad. 

2.2. 1.1 The basic description language Ad 
Concept descriptions in Ad are formed according to the following syntax 
rule: 


A | 

(atomic concept) 

T | 

(universal concept) 

± 

(bottom concept) 

A | 

(atomic negation) 

Cnd 

(intersection) 

VR.C | 

(value restriction) 

3R.T 

(limited existential quantification) 


Note that, in Ad, negation can only be applied to atomic concepts, and only 
the top concept is allowed in the scope of an existential quantification over a 
role. For historical reasons, the sublanguage of Ad obtained by disallowing 
atomic negation is called T d~ and the sublanguage of lFd~ obtained by 
disallowing limited existential quantification is called T d§. 

To give examples of what can be expressed in Ad, we suppose that 
Person and Female are atomic concepts. Then Person n Female and Person n 
-■Female are „4£-concepts describing, intuitively, those persons that are 
female, and those that are not female. If, in addition, we suppose that 
hasChild is an atomic role, we can form the concepts Person ndhasChild.T 
and Person n VhasChild. Female, denoting those persons that have a child, 
and those persons all of whose children are female. Using the bottom con- 
cept, we can also describe those persons without a child by the concept 
Person n VhasChild. _L. 

In order to define a formal semantics of ,4£-concepts, we consider in- 
terpretations T that consist of a non-empty set A x (the domain of the in- 
terpretation) and an interpretation function, which assigns to every atomic 
concept A a set A 1 C A 1 and to every atomic role R a binary relation 
R 1 C A 1 x A x . The interpretation function is extended to concept descrip- 
tions by the following inductive definitions: 

T 1 = A 1 
A 1 = 0 
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(-. A f = A x \ A x 
(C n D) x = c 1 n D 1 

(' MR.C ) x = {oeA 1 Mb. (a, b) G R 1 -»■ b G C 1 } 

( 3R.T f = {a G A 1 \ 3b. (a,b) G R 1 }. 

We say that two concepts C , D are equivalent , and write C = D, if 
C x = D x for all interpretations X. For instance, going back to the defini- 
tion of the semantics of concepts, one easily verifies that VhasChild. Female n 
VhasChild. Student and VhasChild. (Female n Student) are equivalent. 

2.2. 1.2 The family of All-languages 

We obtain more expressive languages if we add further constructors to AC. 
The union of concepts (indicated by the letter U) is written as C U D, and 
interpreted as 

{C U D) 1 = C T U D x . 

Full existential quantification (indicated by the letter S) is written as 
3 R.C, and interpreted as 

(3 R.C) X = {a G A x | 3b. (a, b) € R x A b e C 1 }. 

Note that 3 R.C differs from 3R.T in that arbitrary concepts are allowed to 
occur in the scope of the existential quantifier. 

Number restrictions (indicated by the letter A f) are written as ^ nR (at- 
least restriction) and as ^ n R (at-most restriction) , where n ranges over the 
nonnegative integers. They are interpreted as 

(^ nR) 1 = |a € A x \{b \ (a, b) G R X }\ > n|, 

and 

(^ nR) 1 = |a G A x \{b \ (a, b) G R X }\ < n|, 

respectively, where “| • |” denotes the cardinality of a set. From a seman- 
tic viewpoint, the coding of numbers in number restrictions is immaterial. 
However, for the complexity analysis of inferences it can matter whether a 
number n is represented in binary (or decimal) notation or by a string of 
length n, since binary (decimal) notation allows a more compact represen- 
tation. 

The negation of arbitrary concepts (indicated by the letter C, for “com- 
plement”) is written as -> C, and interpreted as 

(-. C) 1 = A X \ C x . 
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With the additional constructors, we can, for example, describe those 
persons that have either not more than one child or at least three children, 
one of whom is female: 

Person n 1 hasChild U 3 hasChild n dhasChild. Female)). 

Extending AC by any subset of the above constructors yields a particular 
„4£-language. We name each „4£-language by a string of the form 

AC[U][£][Af\[C], 

where a letter in the name stands for the presence of the corresponding 
constructor. For instance, AC£M is the extension of AC by full existential 
quantification and number restrictions (see the appendix on DL terminol- 
ogy for how to extend this naming scheme to more expressive Description 
Logics). 

From the semantic point of view, not all these languages are distinct, how- 
ever. The semantics enforces the equivalences C U D = — i(-i C n ~>D) and 
3R.C = —itiR.—iC. Hence, union and full existential quantification can be 
expressed using negation. Conversely, the combination of union and full 
existential quantification gives us the ability to express negation of con- 
cepts (through their equivalent negation normal form, see Subsection 2.3.2). 
Therefore, we assume w.l.o.g. that union and full existential quantification 
are available in every language that contains negation, and vice versa. It 
follows that (modulo the equivalences mentioned above), all „4£-languages 
can be written using the letters U, £, M only. It is not hard to see that 
the eight languages obtained this way are indeed pairwise non-equivalent. 
In the sequel, we shall not distinguish between an „4£-language with nega- 
tion and its counterpart that has union and full existential quantification 
instead. In the same vein, we shall use the letter C instead of the letters IA£ 
in language names. For instance, we shall write ACC instead of ACU£ and 
ACCM instead of ACU£Af . 

2.2. 1.3 Description languages as fragments of predicate logic 
The semantics of concepts identifies description languages as fragments of 
first-order predicate logic. Since an interpretation X respectively assigns to 
every atomic concept and role a unary and binary relation over A . 1 , we 
can view atomic concepts and roles as unary and binary predicates. Then, 
any concept C can be translated effectively into a predicate logic formula 
4>c( x ) with one free variable x such that for every interpretation X the set 
of elements of A x satisfying <fc( x ) is exactly C x : an atomic concept A is 
translated into the formula A{x)\ the constructors intersection, union, and 
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negation are translated into logical conjunction, disjunction, and negation, 
respectively; if C is already translated into (j>c(x) and R is an atomic role, 
then value restriction and existential quantification are captured by the for- 
mulae 


</>3 R.c(v) = 3x - x ) A 0c(x) 

( k/R.c(y ) = Vx. R(y, x) -» 4>c (x), 

where y is a new variable; number restrictions are expressed by the formulae 


r(%) 

= 3yi,.. 

. , y n . R(x, yi) A ■ ■ ■ A R(x, y n ) A f\yi^ Vj 



i<j 

R{p^) 

= Vyi,.. 

. , y n+ 1 . R(x, yi) A ■ ■ ■ A R(x, y n+1 ) \J Vi = Vj 


i<j 


Note that the equality predicate “=” is needed to express number restric- 
tions, while concepts without number restrictions can be translated into 
equality- free formulae. 

One may argue that, since concepts can be translated into predicate logic, 
there is no need for a special syntax. However, the above translations show 
that, in particular for number restrictions, the variable-free syntax of De- 
scription Logics is much more concise. As can be seen from Section 2.3, it 
also lends itself easily to the development of algorithms. 

A more detailed analysis of the connection between fragments of first- 
order predicate logic and Description Logics can be found in Chapter 4. 


2.2.2 Terminologies 

We have seen how we can form complex descriptions of concepts to describe 
classes of objects. Now, we introduce terminological axioms, which make 
statements about how concepts or roles are related to each other. Then we 
single out definitions as specific axioms and identify terminologies as sets 
of definitions by which we can introduce atomic concepts as abbreviations 
or names for complex concepts. If the definitions in a terminology contain 
cycles, we may have to adopt fixpoint semantics to make them unequivocal. 
We discuss for which types of terminologies fixpoint models exist. 

2.2.2. 1 Terminological axioms 

In the most general case, terminological axioms have the form 
C\ZD (R C S) or C = D ( R = S ), 
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Woman = 
Man = 
Mother = 
Father = 
Parent = 
Grandmother = 
MotherWithManyChildren = 
MotherWithoutDaughter = 
Wife = 


Person n Female 
Person n -iWoman 
Woman n BhasChild. Person 
Man n ElhasChild. Person 
Father U Mother 
Mother n BhasChild. Parent 
Mother n ^ 3 hasChild 
Mother n VhasChild. -iWoman 
Woman n BhasHusband.Man 


Fig. 2.2. A terminology (TBox) with concepts about family relationships. 


where C, D are concepts (and R, S are roles). Axioms of the first kind are 
called inclusions , while axioms of the second kind are called equalities. To 
simplify the exposition, we deal in the following only with axioms involving 
concepts. 

The semantics of axioms is defined as one would expect. An interpretation 
X satisfies an inclusion C C D if C x C D x , and it satisfies an equality C = D 
if C x = D 1 . If X is a set of axioms, then X satisfies X iff X satisfies each 
element of X. If X satisfies an axiom (resp. a set of axioms), then we say 
that it is a model of this axiom (resp. set of axioms). Two axioms or two 
sets of axioms are equivalent if they have the same models. 


2. 2.2.2 Definitions 

An equality whose left-hand side is an atomic concept is a definition. Defi- 
nitions are used to introduce symbolic names for complex descriptions. For 
instance, by the axiom 

Mother = Woman n ElhasChild. Person 

we associate to the description on the right-hand side the name Mother. 
Symbolic names may be used as abbreviations in other descriptions. If, for 
example, we have defined Father analogously to Mother, we can define Parent 
as 


Parent = Mother LI Father. 

A set of definitions should be unequivocal. We call a finite set of definitions 
X a terminology or TBox if no symbolic name is defined more than once, 
that is, if for every atomic concept A there is at most one axiom in X whose 
left-hand side is A. Figure 2.2 shows a terminology with concepts concerned 
with family relationships. 

Suppose X is a terminology. We divide the atomic symbols occurring in X 
into two sets, the name symbols Nt that occur on the left-hand side of some 
axiom and the base symbols Bt that occur only on the right-hand side of 
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axioms. Name symbols are often called defined concepts and base symbols 
primitive concepts and primitive roles. 2 We expect that the terminology 
defines the name symbols in terms of the base symbols, which now we make 
more precise. 

A base interpretation for T is an interpretation that interprets only the 
base symbols. Let J be such a base interpretation. An interpretation X that 
interprets also the name symbols is an extension of J if it has the same 
domain as J . i.e., A 1 * = A^, and if it agrees with J for the base symbols. 
We say that T is definitorial if every base interpretation has exactly one 
extension that is a model of T . In other words, if we know what the base 
symbols stand for, and T is definitorial, then the meaning of the name 
symbols is completely determined. Obviously, if a terminology is definitorial, 
then every equivalent terminology is also definitorial. 

The question whether a terminology is definitorial or not is related to the 
question whether or not its definitions are cyclic. For instance, the termi- 
nology that consists of the the single axiom 

Human 7 = Animal n VhasParent. Human 7 (2-1) 

contains a cycle because, intuitively, Human 7 is defined in terms of Human 7 . 
We now give a technical definition of cycles in terminologies. Let A, B be 
atomic concepts occurring in T. We say that A directly uses B in T if B 
appears on the right-hand side of the definition of A, and we define uses 
to be the transitive closure of the relation directly uses. Then T contains a 
cycle iff there exists an atomic concept in T that uses itself. Otherwise, T 
is called acyclic. 

Unique extensions need not exist if a terminology contains cycles. Con- 
sider, for instance, the terminology that contains only Axiom (2.1). Here, 
Human 7 is a name symbol and Animal and hasParent are base symbols. For 
an interpretation where hasParent relates every animal to its progenitors, 
many extensions are possible to interpret Human 7 in a such a way that the 
axiom is satisfied: Human 7 can, among others, be interpreted as the set of 
all animals, as some species, or any other set of animals with the property 
that for each animal it contains also its progenitors. 

In contrast, if a terminology T is acyclic, then it is definitorial. The reason 
is that we can expand through an iterative process the definitions in T by 
replacing each occurrence of a name on the right-hand side of a definition by 
the concepts that it stands for. Since there is no cycle in the set of definitions, 

2 Note that some papers use the notion “primitive concept” with a different meaning; e.g., syn- 

onymous with what we call atomic concepts, or to denote the (atomic) left-hand sides of concept 

inclusions. 
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Woman = 

Person n Female 

Man = 

Person n -^(Person n Female) 

Mother = 

(Person n Female) n BhasChild. Person 

Father = 

(Person n -i(Person n Female)) n ElhasChild. Person 

Parent = 

((Person n ->(Person n Female)) n ElhasChild. Person) 
U ((Person n Female) n 3hasChild. Person) 

Grandmother = 

((Person n Female) n 3hasChild. Person) 
n 3hasChild. (((Person n -i(Person n Female)) 
n 3hasChild. Person) 

U ((Person n Female) 
n BhasChild. Person)) 

MotherWithManyChildren = 

((Person n Female) n 3hasChild. Person) n ^ 3 hasChild 

MotherWithoutDaughter = 

((Person n Female) n 3hasChild. Person) 
n VhasChild.(-i(Person n Female)) 

Wife = 

(Person n Female) 

fl 3hasHusband. (Person n ^(Person n Female)) 


Fig. 2.3. The expansion of the Family TBox in Figure 2.2. 


the process eventually stops and we end up with a terminology T' consisting 
solely of definitions of the form A = C' , where C' contains only base symbols 
and no name symbols. We call T' the expansion of T . Note that the size 
of the expansion can be exponential in the size of the original terminology 
[Nebel, 1990b]. The Family TBox in Figure 2.2 is acyclic. Therefore, we can 
compute the expansion, which is shown in Figure 2.3. 

Proposition 2.1 LetT be an acyclic terminology and T be its expansion. 
Then: 

1. T and T' have the same name and base symbols; 

2. T and T' are equivalent; 

3. both T and T are definitorial. 


Proof Let T± be a terminology. Suppose A = C and B = D are definitions 
in 7i such that B occurs in C. Let C be the concept obtained from C by 
replacing each occurrence of B in C by D, and let 72 be the terminology 
obtained from by replacing the definition A = C by A = C' . Then both 
terminologies have the same name and base symbols. Moreover, since T 2 has 
been obtained from T\ by replacing equals by equals, both terminologies have 
the same models. Since T is obtained from T by a sequence of replacement 
steps like the ones above, this proves claims 1 and 2. 

Suppose now that J is an interpretation of the base symbols. We extend 
it to an interpretation T that covers also the name symbols by setting A 1 = 
C' J , if A = C' is the definition of A in T 1 . Clearly, Z is a model of T' . 
and it is the only extension of J that is a model of T' . This shows that 
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T' is definitorial. Moreover, T is definitorial as well, since it is equivalent 
to T . □ 

It is characteristic of acyclic terminologies, in a sense to be made more 
precise, to uniquely define the name symbols in terms of the base symbols. 

Of course, there are also terminologies with cycles that are definitorial. 
Consider for instance the one consisting of the axiom 

A = VR.BU3R.(An^A), (2.2) 

which has a cycle. However, since 3R.(A n —'A) is equivalent to the bottom 
concept, Axiom (2.2) is equivalent to the acyclic axiom 

A = VR.B. (2.3) 

This example is typical of the general situation. 

Theorem 2.2 Every definitorial ACC -terminology is equivalent to an 
acyclic terminology. 

The theorem is a reformulation of Beth’s Definability Theorem [Gabbay, 
1972] for the modal propositional logic K n , which, as shown by Schild [1991], 
is a notational variant of ACC. 

2. 2.2. 3 Fixpoint semantics for terminological cycles 
Under the semantics we have studied so far, which is essentially the seman- 
tics of first-order logic, terminologies have definitorial effect only if they are 
essentially acyclic. Following Nebel [1991], we shall call this semantics de- 
scriptive semantics to distinguish it from the fixpoint semantics introduced 
below. Fixpoint semantics are motivated by the fact that there are situations 
where intuitively cyclic definitions are meaningful and the intuition can be 
captured by least or greatest fixpoint semantics. 

Example 2.3 Suppose that we want to specify the concept of a “man who 
has only male descendants”, for short Momd. In particular, such a man is 
a Mos, that is, a “man who has only sons”. A Mos can be defined without 
cycles as 

Mos = Man n VhasChild.Man. 

For a Momd, however, we want to make a statement about the fillers of the 
transitive closure of the role hasChild. Here a recursive definition of Momd 
seems to be natural. A man having only male descendants is himself a man, 
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and all his children are men having only male descendants: 

Momd = Man n VhasChild.Momd. (2-4) 

In order to achieve the desired meaning, we have to interpret this definition 
under an appropriate fixpoint semantics. We shall show below that greatest 
fixpoint semantics captures our intuition here. ■ 

Cycles also appear when we want to model recursive structures, e.g., bi- 
nary trees . 3 

Example 2.4 We suppose that there is a set of objects that are Trees and 
a binary relation has-branch between objects that leads from a tree to its 
subtrees. Then the binary trees are the trees with at most two subtrees that 
are themselves binary trees: 

BinaryTree = Tree n ^ 2 has-branch n Vhas-branch.BinaryTree. 

As with the definition of Momo, a fixpoint semantics will yield the de- 
sired meaning. However, for this example we have to use least fixpoint 
semantics. ■ 

We now give a formal definition of fixpoint semantics. In a terminology 
X, every name symbol A occurs exactly once as the left-hand side of an 
axiom A = C. Therefore, we can view T as a mapping that associates to a 
name symbol A the concept description T (A) = C. With this notation, an 
interpretation X is a model of T if, and only if, A 1 = (T (A)) x . This charac- 
terization has the flavor of a fixpoint equation. We exploit this similarity to 
introduce a family of mappings such that an interpretation is a model of T 
iff it is a fixpoint of such a mapping. 

Let X be a terminology, and let J be a fixed base interpretation of X. 
By Extj we denote the set of all extensions of J . Let Tj\ Extj — > Extj be 
the mapping that maps the extension X to the extension Tj(I) defined by 
= (T(A)) 1 for each name symbol A. 

Now, X is a fixpoint of Tj iff X = Tj(2), i.e. , iff A 1 = A Tj ^ for all name 
symbols. This means that, for every definition A = C in X, we have A x = 
4 X 7 X) = (2(A)) 1 = C 1 , which means that X is a model of X. This proves 
the following result. 

Proposition 2.5 Let T be a terminology, X be an interpretation, and J be 
the restriction of 2 to the base symbols of T . Then X is a model ofT if, and 

3 The following example is taken from [Nebel, 1991]. 



Basic Description Logics 


61 


only if, X is a fixpoint of Zj . 

According to the preceding proposition, a terminology Z is definitorial iff 
every base interpretation J has a unique extension that is a fixpoint of Zj. 

Example 2.6 To get a feel for why cyclic terminologies are not definito- 
rial, we discuss as an example the terminology X Momd that consists only of 
Axiom (2.4). Consider the base interpretation J defined by 

A" 7 = { Charles ] , CharleS' 2 , . . .} U { Jamesi , . . . , James Last} 

Mam 7 = A " 7 

hasChild " 7 = {( Chariest , Charlesn^) | i > 1} U 

{(Jamesi, JameS(j +1 )) | 1 < i < Last}. 

This means that the Charles dynasty does not die out, whereas there is a 
last member of the James dynasty. 

We want to identify the fixpoints of 7"^ /lomd . Note that an individual with- 
out children, i.e., without fillers of hasChild, is always in the interpretation 
of VhasChild.Momd, no matter how Momd is interpreted. Therefore, if X is a 
fixpoint extension of J . then JamesLast is in (VhasChild.Momd) 7 , and thus 
in Momd 7 . We conclude that every James is a Momd. Let Z\ be the exten- 
sion of J such that Momd 71 comprises exactly the James dynasty. Then it is 
easy to check that Z\ is a fixpoint. If, in addition to the James dynasty, some 
Charles is a Momd, then all the members of the Charles dynasty before and 
after him must belong to the concept Momd. One can easily check that the 
extension X 2 that interprets Momd as the entire domain is also a fixpoint, 
and that there is no other fixpoint. ■ 

In order to give definitorial effect to a cyclic terminology X, we must single 
out a particular fixpoint of the mapping Tj if there are more than one. To 
this end, we define a partial ordering “A” on the extensions of J . We say 
that TAX 7 if A 7 C A 1 ' for every name symbol in X. In the above example, 
Momd is the only name symbol. Since Momd 71 C Momd 72 , we have Z\ A X 2 . 

A fixpoint X of Tj is the least fixpoint (lfp) if X A Z l for every other fix- 
point X 7 . We say that X is a least fixpoint model of X if X is the least fixpoint 
of Tj for some base interpretation J . Under least fixpoint semantics we only 
admit the least fixpoint models of X as intended interpretations. Greatest 
fixpoints (gfp), greatest fixpoint models, and greatest fixpoint semantics are 
defined analogously. In the Momd example, Z\ is the least and X 2 the greatest 
fixpoint of Zj . 
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2.2.2. 4 Existence of fixpoint models 

Least and greatest fixpoint models need not exist for every terminology. 

Example 2.7 As a simple example, consider the axiom 

A = ->A. (2.5) 

If X is a model of this axiom, then A 1 = A x \ A 1 , which implies A x = 0, an 
absurdity. 

A terminology containing Axiom (2.5) thus does not have any models, 
and therefore also no gfp (lfp) models. 

There are also cases where models (i.e., fixpoints) exist, but there is neither 
a least one nor a greatest one. As an example, consider the terminology T 
with the single axiom 

A = \/R.-iA. (2.6) 

Let J be the base interpretation with A ^ = {a, b} and = {(a, b), (6, a)}. 
Then there are two fixpoint extensions Zi, Z 2 , defined by A 11 = {a} and 
A 12 = {6}. However, they are not comparable with respect to “A”. ■ 

In order to identify terminologies with the property that for every base 
interpretation there exists a least and a greatest fixpoint extension, we draw 
upon results from lattice theory. Recall that a lattice is complete if every 
family of elements has a least upper bound. 

On Extj we have introduced the partial ordering “A”. For a family of 
interpretations (Zj)j e j in Extj we define Xo = |_| as the pointwise union 
of the ZjS, that is, for every name symbol A we have A x ° = (Jig/ A 1 ' . Then 
Zo is the least upper bound of the X,s, which shows that (Extj, A) is a 
complete lattice. 

A function /: L — » L on a lattice (X, A) is monotone if f(x) A f(y) when- 
ever x Ay. Tarski’s Fixpoint Theorem [Tarski, 1955] says that for a mono- 
tone function on a complete lattice the set of fixpoints is nonempty and 
itself forms a complete lattice. In particular, there is a least and a greatest 
fixpoint. 

We define that a terminology X is monotone if the mapping Tj is mono- 
tone for all base interpretations J . By Tarski’s theorem, such terminologies 
have greatest and least fixpoints. 

Proposition 2.8 If T is a monotone terminology and J a base interpreta- 
tion, then there exist extensions of J that are an Ifp-model and a gfp-model 
of X, respectively. 
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To find a syntactic criterion for monotonicity, we consider the operators 
of the AC-languages that are applied to concepts, namely, “n” , “U” , “-i” , 
“V”, and “3”. 

These operators have as their natural counterparts operations on the sub- 
sets of some domain, say A. Clearly, intersection and union correspond to 
the set operations with the same names, while negation corresponds to com- 
plement with respect to the domain. Since “V” and “3” are applied to a role 
and a concept, we introduce, for every binary relation r on A, the unary 
operation A r that maps a set S C A to 

A r (S) = {a e A | V6. (a, b) € r -»• b e S} 

and the operation E r that maps a set S' C A to 

E r (S) = {a € A | 36. (a, b) G r A b € S}. 

Then we see that all these operations, with the sole exception of com- 
plement, are monotone in the sense that if they are applied to larger ar- 
guments, then the results are larger, too. This is obvious for intersection 
and union. Also, one readily checks that S C S' implies A r (S) C A r (S') and 
E r (S) C E r (S / ). Obviously, the analogous statement for negation does not 
hold. 

We call a terminology negation- free if no negation sign occurs in it. Based 
on the above observations we can prove by induction a sufficient syntactic 
condition for an ATCAAterminology to be monotone. 

Proposition 2.9 Every negation- free ACC J\f -terminology is monotone. 

We can generalize this criterion by analyzing which negation signs in a 
terminology are potentially harmful for monotonicity. Clearly, a negation 
sign can only affect the monotonicity of a terminology if in its scope there 
is a name symbol. Moreover, intuitively, a negation sign is neutralized if it 
is in the scope of another negation. This motivates the following definition. 
We say a concept C is syntactically monotone if each name symbol in C 
occurs in the scope of an even number of negation signs. A terminology is 
syntactically monotone if every concept on the right-hand side of an equality 
is syntactically monotone. 

Proposition 2.10 Every syntactically monotone ACCAf -terminology is 
monotone. 



64 


F. Baader and W. Nutt 


2. 2. 2. 5 Terminologies with inclusion axioms 
For certain concepts we may be unable to define them completely. In this 
case, we can still state necessary conditions for concept membership using an 
inclusion. We call an inclusion whose left-hand side is atomic a specialization. 

For example, if a (male) knowledge engineer thinks that the definition of 
“woman” in our example TBox (Figure 2.2) is not satisfactory, but if he also 
feels that he is not able to define the concept “woman” in all detail, he can 
require that every woman is a person with the specialization 

Woman C Person. (2.7) 

If we also allow specializations in a terminology, then the terminology loses 
its definitorial effect, even if it is acyclic. A set of axioms T is a generalized 
terminology if the left-hand side of each axiom is an atomic concept and 
for every atomic concept there is at most one axiom where it occurs on the 
left-hand side. 

We shall transform a generalized terminology T into a regular terminol- 
ogy X, containing definitions only, such that T is equivalent to T in a sense 
that will be specified below. We obtain T from T by choosing for every 
specialization A C C in T a new base symbol A and by replacing the spe- 
cialization ACC with the definition A = A n C. The terminology T is the 
normalization of T . 

If a TBox contains the specialization (2.7), then the normalization con- 
tains the definition 

Woman = Woman n Person. 

Intuitively, the additional base symbol Woman stands for the qualities that 
distinguish a woman among persons. Thus, normalization results in a TBox 
with a definition for Woman that is similar to the one in the Family TBox. 

Proposition 2.11 Let T be a generalized terminology and T its normal- 
ization. 

• Every model ofT is a model of T . 

• For every model I of X there is a model X of X that has the same domain as I 
and agrees with X on the atomic concepts and roles in T . 

Proof The first claim holds because a model X of X satisfies A x = (A n C) x 
= A 1 D C 1 , which implies A^ C CX Conversely, if X is a model of X, then 
the extension X of X, defined by A X = A 1 , is a model of X, because A 1 C C x 
implies A 1 = A 1 fi C 1 = A 1 D C^, and therefore X satisfies A = A n C. □ 
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MotherWithoutDa Lighter) MARY) Father( PETER) 

hasChild(MARY, PETER) hasChild(PETER. HARRY) 

hasChild(MARY, PAUL) 

Fig. 2.4. A world description (ABox). 

Thus, in theory, inclusion axioms do not add to the expressivity of ter- 
minologies. However, in practice, they are a convenient means to introduce 
terms into a terminology that cannot be defined completely. 


2.2.3 World descriptions 

The second component of a knowledge base, in addition to the terminology 
or TBox, is the world description or ABox. 

2.2.3. 1 Assertions about individuals 

In the ABox, one describes a specific state of affairs of an application do- 
main in terms of concepts and roles. Some of the concept and role atoms in 
the ABox may be defined names of the TBox. In the ABox, one introduces 
individuals, by giving them names, and one asserts properties of these indi- 
viduals. We denote individual names by a, b, c. Using concepts C and roles 
R, one can make assertions of the following two kinds in an ABox: 

C{a), R(b,c). 

By the first kind, called concept assertions, one states that a belongs to (the 
interpretation of) C. By the second kind, called role assertions, one states 
that c is a filler of the role R for b. For instance, if PETER, PAUL, and MARY 
are individual names, then Father(PETER) means that Peter is a father, and 
hasChild(MARY, PAUL) means that Paul is a child of Mary. An ABox, denoted 
by A, is a finite set of such assertions. Figure 2.4 shows an example of an 
ABox. 

In a simplified view, an ABox can be seen as an instance of a relational 
database with only unary or binary relations. However, contrary to the 
“closed-world semantics” of classical databases, the semantics of ABoxes 
is an “open- world semantics” , since normally knowledge representation sys- 
tems are applied in situations where one cannot assume that the knowledge 
in the KB is complete. 4 Moreover, the TBox imposes semantic relationships 
between the concepts and roles in the ABox that do not have counterparts 
in database semantics. 

4 We discuss implications of this difference in semantics in Subsection 2. 2. 4. 4. 
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We give a semantics to ABoxes by extending interpretations to individual 
names. From now on, an interpretation Z = (A 1 , - x ) not only maps atomic 
concepts and roles to sets and relations, but in addition maps each individual 
name a to an element afi £ A 1 . We assume that distinct individual names 
denote distinct objects. Therefore, this mapping has to respect the unique 
name assumption (UNA), that is, if a, b are distinct names, then a 1 b x . 

The interpretation Z satisfies the concept assertion C(a) if a 1 £ C 1 , and it 
satisfies the role assertion R(a, b) if (a 1 , b T ) £ R 1 . An interpretation satisfies 
the ABox A if it satisfies each assertion in A. In this case we say that Z is a 
model of the assertion or of the ABox. Finally, Z satisfies an assertion a or 
an ABox A with respect to a TBox T if in addition to being a model of a 
or of A. it is a model of T . Thus, a model of A and T is an abstraction of a 
concrete world where the concepts are interpreted as subsets of the domain 
as required by the TBox and where the membership of the individuals to 
concepts and their relationships with one another in terms of roles respect 
the assertions in the ABox. 

2. 2. 3. 2 Individual names in the description language 

Sometimes, it is convenient to allow individual names (also called nomi- 
nals ) not only in the ABox, but also in the description language. Some 
concept constructors employing individuals occur in systems and have been 
investigated in the literature. The most basic one is the “set” (or one-of ) 
constructor, written 


{dr , . . . , > 

where ai,...,a n are individual names. As one would expect, such a set 
concept is interpreted as 

{oi, . . . ,o n } J = {a\,...,a x }. (2.8) 

With sets in the description language one can for instance define 
the concept of permanent members of the UN security council as 
{CHINA, FRANCE, RUSSIA, UK, USA}. 

In a language with the union constructor “U” , a constructor {a} for single- 
ton sets alone adds sufficient expressiveness to describe arbitrary finite sets 
since, according to the semantics of the set constructor in Equation (2.8), 
the concepts {ai, . . . , a n } and {gi} U • • • U {a n } are equivalent. 

Another constructor involving individual names is the “fills” constructor 


R : a 
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for a role R. The semantics of this constructor is defined as 

(R : a) 1 = {d € A 1 | (, d , a 1 ) e R 1 }, (2.9) 

that is, R : a stands for the set of those objects that have a as a filler of 
the role R. To a description language with singleton sets and full existen- 
tial quantification, “fills” does not add anything new, since Equation (2.9) 
implies that R : a and Eli?. {a} are equivalent. 

We note, finally, that “fills” allows one to express role assertions 
through concept assertions: an interpretation satisfies R(a, b ) iff it satisfies 
(3R.{b})(a). 


2.2.4 Inferences 

A knowledge representation system based on Description Logics is able to 
perform specific kinds of reasoning. As said before, the purpose of a knowl- 
edge representation system goes beyond storing concept definitions and as- 
sertions. A knowledge base - comprising TBox and ABox - has a semantics 
that makes it equivalent to a set of axioms in first-order predicate logic. 
Thus, like any other set of axioms, it contains implicit knowledge that can be 
made explicit through inferences. For example, from the TBox in Figure 2.2 
and the ABox in Figure 2.4 one can conclude that Mary is a grandmother, 
although this knowledge is not explicitly stated as an assertion. 

The different kinds of reasoning performed by a DL system (see Chap- 
ter 8) are defined as logical inferences. In the following, we shall discuss 
these inferences, first for concepts, then for TBoxes and ABoxes, and finally 
for TBoxes and ABoxes together. It will turn out that there is one main 
inference problem, namely the consistency check for ABoxes, to which all 
other inferences can be reduced. 

2. 2. 4-1 Reasoning tasks for concepts 

When modeling a domain, a knowledge engineer constructs a terminology, 
say T, by defining new concepts, possibly in terms of others that have been 
defined before. During this process, it is important to find out whether a 
newly defined concept makes sense or whether it is contradictory. From a 
logical point of view, a concept makes sense for us if there is some inter- 
pretation that satisfies the axioms of T (that is, a model of T) such that 
the concept denotes a nonempty set in that interpretation. A concept with 
this property is said to be satisfiable with respect to T and unsatisfiable 
otherwise. 
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Checking satisfiability of concepts is a key inference. As we shall see, 
a number of other important inferences for concepts can be reduced to 
(un) satisfiability. For instance, in order to check whether a domain model 
is correct, or to optimize queries that are formulated as concepts, we may 
want to know whether some concept is more general than another one: this 
is the subsumption problem. A concept C is subsumed by a concept D if in 
every model of X the set denoted by C is a subset of the set denoted by 
D. Algorithms that check subsumption are also employed to organize the 
concepts of a TBox in a taxonomy according to their generality. Further 
interesting relationships between concepts are equivalence and disjointness. 
These properties are formally defined as follows. Let X be a TBox. 

Satisfiability A concept C is satisfiable with respect to X if there exists a 
model X of T such that C x is nonempty. In this case we say also that 
X is a model of C. 

Subsumption A concept C is subsumed by a concept D with respect to X 
if C x C D x for every model X of X. In this case we write C C 7 - D or 

x != C E D. 

Equivalence Two concepts C and D are equivalent with respect to X if 
C x = D x for every model X of X. In this case we write C = 7 - D or 
X j= C = D. 

Disjointness Two concepts C and D are disjoint with respect to X if C x n 
D x = 0 for every model X of X. 

If the TBox X is clear from the context, we sometimes drop the qualification 
“with respect to X” . 

We also drop the qualification in the special case where the TBox is empty, 
and we simply write |= C C D if C is subsumed by D. and |= C = D if C 
and D are equivalent. 


Example 2.12 With respect to the TBox in Figure 2.2, Person subsumes 
Woman, both Woman and Parent subsume Mother, and Mother subsumes 
Grandmother. Moreover, Woman and Man, and Father and Mother are dis- 
joint. The subsumption relationships follow from the definitions because of 
the semantics of “n” and “U”. That Man is disjoint from Woman is due to 
the fact that Man is subsumed by the negation of Woman. ■ 


Traditionally, the basic reasoning mechanism provided by DL systems 
checked the subsumption of concepts. This, in fact, is sufficient to implement 
also the other inferences, as can be seen by the following reductions. 
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Proposition 2.13 (Reduction to Subsumption) For concepts C , D we 
have: 

1. C is unsatisfiable ^ C is subsumed by J_; 

2. C and D are equivalent ^ C is subsumed by D and D is subsumed by C ; 

3. C and D are disjoint -o- C n D is subsumed by _L. 

The statements also hold with respect to a TBox. 

All description languages implemented in actual DL systems provide the 
intersection operator “n” and almost all of them contain an unsatisfiable 
concept. Thus, most DL systems that can check subsumption can perform 
all four inferences defined above. 

If, in addition to intersection, a system allows one to form the negation 
of a description, one can reduce subsumption, equivalence, and disjointness 
of concepts to the satisfiability problem (see also Srnolka [1988]). 

Proposition 2.14 (Reduction to Unsatisfiability) For concepts C, D 
we have: 

1. C is subsumed by D 4$ C n ~^D is unsatisfiable; 

2. C and D are equivalent -o- both (C n -<D) and (-> C n D) are unsatisfiable; 

3. C and D are disjoint •<=> C n D is unsatisfiable. 

The statements also hold with respect to a TBox. 

The reduction of subsumption can easily be understood if one recalls that, 
for sets M, N, we have M C N iff M \ N = 0. The reduction of equivalence 
is correct because C and D are equivalent if, and only if, C is subsumed by 
D and D is subsumed by C. Finally, the reduction of disjointness is just a 
rephrasing of the definition. 

Because of the above proposition, in order to obtain decision procedures 
for any of the four inferences we have discussed, it is sufficient to develop 
algorithms that decide the satisfiability of concepts, provided the language 
for which we can decide satisfiability supports conjunction as well as negation 
of arbitrary concepts. 

In fact, this observation motivated researchers to study description lan- 
guages in which, for every concept, one can also form the negation of that 
concept [Srnolka, 1988; Schmidt-Schaufi and Srnolka, 1991; Donini et ai, 
1991b; 1997a]. The approach that considers satisfiability checking as the 
principal inference gave rise to a new kind of algorithms for reasoning in 
Description Logics, which can be understood as specialized tableau calculi 
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(see Section 2.3 in this chapter and Chapter 3). Also, the most recent gen- 
eration of DL systems, like Kris [Baader and Hollunder, 1991b], Crack 
[Bresciani et al., 1995], FaCT [Horrocks, 1998b], Dlp [Patel-Schneider, 1999], 
and Race [Haarslev and Moller, 2001e], are based on satisfiability checking, 
and a considerable amount of research work is spent on the development of 
efficient implementation techniques for this approach [Baader et al, 1994; 
Horrocks, 1998b; Horrocks and Patel-Schneider, 1999; Haarslev and Moller, 
2001c]. 

In an A£-language without full negation, subsumption and equivalence 
cannot be reduced to unsatisfiability in the simple way shown in Proposi- 
tion 2.14 and therefore these inferences may be of different complexity. 

As seen in Proposition 2.13, from the viewpoint of worst-case complexity, 
subsumption is the most general inference for any A£-language. The next 
proposition shows that unsatisfiability is a special case of each of the other 
problems. 

Proposition 2.15 (Reducing Unsat isfiability) LetC be a concept. Then 
the following are equivalent: 

1. C is unsatisfiable; 

2. C is subsumed by _L; 

3. C and _L are equivalent; 

4. C and T are disjoint. 

The statements also hold with respect to a TBox. 

From Propositions 2.13 and 2.15 we see that, in order to obtain upper 
and lower complexity bounds for inferences on concepts in A£-languages, 
it suffices to assess lower bounds for unsatisfiability and upper bounds for 
subsumption. More precisely, for each AT-language, an upper bound for the 
complexity of the subsumption problem is also an upper bound for the com- 
plexity of the unsatifiability, the equivalence, and the disjointness problem. 
Moreover, a lower bound for the complexity of the unsatifiability problem is 
also a lower bound for the complexity of the subsumption, the equivalence, 
and the disjointness problem. 

2. 2. f.2 Eliminating the TBox 

In applications, concepts usually come in the context of a TBox. However, 
for developing reasoning procedures it is conceptually easier to abstract from 
the TBox or, what amounts to the same, to assume that it is empty. 
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We show that, if T is an acyclic TBox, we can always reduce reasoning 
problems with respect to T to problems with respect to the empty TBox. 
As we have seen in Proposition 2.1, T is equivalent to its expansion T . 
Recall that in the expansion every definition is of the form A = D such that 
D contains only base symbols, not name symbols. Now, for each concept C 
we define the expansion of C with respect to T as the concept C that is 
obtained from C by replacing each occurrence of a name symbol A in C by 
the concept D, where A = D is the definition of A in T , the expansion of T. 
For example, we obtain the expansion of the concept 

Woman n Man (2.10) 

with respect to the TBox in Figure 2.2 by considering the expanded TBox 
in Figure 2.3, and replacing Woman and Man by the right-hand sides of their 
definitions in this expansion. This results in the concept 

Person n Female n Person n -i(Person n Female). (2-11) 

We can readily deduce a number of facts about expansions. Since the ex- 
pansion C' is obtained from C by replacing names by descriptions in such a 
way that both are interpreted in the same way in any model of T . it follows 
that: 

• C =r C'. 

Hence, C is satisfiable w.r.t. T iff C' is satisfiable w.r.t. T. However, C' con- 
tains no defined names, and thus C' is satisfiable w.r.t. T iff it is satisfiable. 
This yields that: 

• C is satisfiable w.r.t. T iff C' is satisfiable. 

If D is another concept, then we have also D =r D' . Thus, C C 7 - D iff 
C C 7 - D' , and C =r D iff C = 7 - D' . Again, since C' and D' contain only 
base symbols, this implies: 

. T^CQD iff \=C'QD'; 

• T\=C = D iff \= C' = D'. 

By similar arguments we can show that: 

• C and D are disjoint w.r.t. T iff C and D' are disjoint. 

Summing up, expanding concepts with respect to an acyclic TBox allows 
one to get rid of the TBox in reasoning problems. Going back to our example 
from above, this means that, in order to verify whether Man and Woman 
are disjoint with respect to the Family TBox, which amounts to checking 
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whether Man n Woman is unsatisfiable, it suffices to check that the concept 
(2.11) is unsatisfiable. 

Expanding concepts may be computationally costly, since in the worst 
case the size of T' is exponential in the size of T, and therefore C' may be 
larger than C by a factor that is exponential in the size of T . A complexity 
analysis of the difficulty of reasoning with respect to TBoxes shows that 
the expansion of definitions is a source of complexity that cannot always be 
avoided (see Subsection 2.3.3 of this chapter and Chapter 3). 

2. 2-4-3 Reasoning tasks for ABoxes 

After a knowledge engineer has designed a terminology and has used the 
reasoning services of the DL system to check that all concepts are satisfi- 
able and that the expected subsumption relationships hold, the ABox can be 
filled with assertions about individuals. We recall that an ABox contains two 
kinds of assertions, concept assertions of the form C(a) and role assertions of 
the form R{a, b). Of course, the representation of such knowledge has to be 
consistent, because otherwise - from the viewpoint of logic - one could draw 
arbitrary conclusions from it. If, for example, the ABox contains the asser- 
tions Mother(MARY) and Father(MARY), the system should be able to find 
out that, together with the Family TBox, these statements are inconsistent. 

In terms of our model-theoretic semantics we can easily give a formal 
definition of consistency. An ABox A is consistent with respect to a TBox 
T, if there is an interpretation that is a model of both A and T. We 
simply say that A is consistent if it is consistent with respect to the 
empty TBox. 

For example, the set of assertions {Mother(MARY), Father(MARY)} is con- 
sistent (with respect to the empty TBox), because without any further re- 
strictions on the interpretation of Mother and Father, the two concepts can 
be interpreted in such a way that they have a common element. However, 
the assertions are not consistent with respect to the Family TBox, since in 
every model of it, Mother and Father are interpreted as disjoint sets. 

Similarly as for concepts, checking the consistency of an ABox with respect 
to an acyclic TBox can be reduced to checking an expanded ABox. We define 
the expansion of A with respect to T as the ABox A' that is obtained from 
A by replacing each concept assertion C(a) in A by the assertion C'(a), 
where C' is the expansion of C with respect to T In every model of T, a 

5 We expand only concept assertions because the description language considered until now does 
not provide constructors for role descriptions and therefore we have not considered TBoxes with 
role definitions. If the description language is richer, and TBoxes contain also role definitions, 
then they clearly have to be taken into account in the definition of expansions. 
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concept C and its expansion C are interpreted in the same way. Therefore, 
A' is consistent w.r.t. T iff A is so. However, since A' does not contain a 
name symbol defined in T, it is consistent w.r.t. T iff it is consistent. We 
conclude: 

• A is consistent w.r.t. T iff its expansion A' is consistent. 

A technique to check the consistency of ATCAA ABoxes is discussed in Sub- 
section 2.3.2. 

Other inferences that we are going to introduce can also be defined with 
respect to a TBox or for an ABox alone. As in the case of consistency, rea- 
soning tasks for ABoxes with respect to acyclic TBoxes can be reduced to 
reasoning on expanded ABoxes. For the sake of simplicity, we shall give only 
definitions of inferences with ABoxes alone, and leave it to the reader to for- 
mulate the appropriate generalization to inferences with respect to TBoxes 
and to verify that they can be reduced to inferences about expansions, pro- 
vided the TBox is acyclic. 

Over an ABox A, one can pose queries about the relationships between 
concepts, roles and individuals. The prototypical ABox inference on which 
such queries are based is instance checking , or the check whether an assertion 
is entailed by an ABox. We say that an assertion a is entailed by A and we 
write A |= a, if every interpretation that satisfies A, that is, every model of 
A, also satisfies a. If a is a role assertion, instance checking is easy, since our 
description language does not contain constructors to form complex roles. 
If a is of the form C'(a), we can reduce instance checking to the consistency 
problem for ABoxes because there is the following connection: 

• A |= C(a) iff dll {->(7(0)} is inconsistent. 

Also reasoning about concepts can be reduced to consistency checking. We 
have seen in Proposition 2.14 that the important reasoning problems for con- 
cepts can be reduced to that of deciding whether a concept is (un)satisfiable. 
Similarly, concept satisfiability can be reduced to ABox consistency because 
for every concept C we have: 

• C is satisfiable iff {C(a)} is consistent, 

where a is an arbitrarily chosen individual name. Conversely, Schaerf [1994b] 
has shown that ABox consistency can be reduced to concept satisfiability 
in languages with the “set” and the “fills” constructor. If these constructors 
are not available, however, then instance checking may be harder than the 
satisfiability and the subsumption problem [Donini et al, 1994b]. 
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For applications, more complex inferences than consistency and instance 
checking are usually required. If we consider a knowledge base as a means 
to store information about individuals, we may want to know all individu- 
als that are instances of a given concept description C, that is, we use the 
description language to formulate queries. In our example, we may want 
to know from the system all parents that have at least two children - for 
instance, because they are entitled to a specific family tax break. The re- 
trieval problem is, given an ABox A and a concept C, to find all individuals 
a such that A \= C(a) . A non-optimized algorithm for a retrieval query can 
be realized by testing for each individual occurring in the ABox whether it 
is an instance of the query concept C. 

The dual inference to retrieval is the realization problem : given an indi- 
vidual a and a set of concepts, find the most specific concepts C from the 
set such that A (= C(a). Here, the most specific concepts are those that are 
minimal with respect to the subsumption ordering C. Realization can, for 
instance, be used in systems that generate natural language if terms are 
indexed by concepts and if a term as precise as possible is to be found for 
an object occurring in a discourse. 

2 . 2 . 4-4 Closed- vs. open-world semantics 

Often, an analogy is established between databases on the one hand and 
DL knowledge bases on the other hand (see also Chapter 16). The schema 
of a database is compared to the TBox and the database instance with the 
actual data is compared to the ABox. However, the semantics of ABoxes 
differs from the usual semantics of database instances. While a database 
instance represents exactly one interpretation, namely the one where classes 
and relations in the schema are interpreted by the objects and tuples in the 
instance, an ABox represents many different interpretations, namely all its 
models. As a consequence, absence of information in a database instance 
is interpreted as negative information, while absence of information in an 
ABox only indicates lack of knowledge. 

For example, if the only assertion about Peter is hasChild(PETER, HARRY), 
then in a database this is understood as a representation of the fact that 
Peter has only one child, Harry. In an ABox, the assertion only expresses 
that, in fact, Harry is a child of Peter. However, the ABox has several 
models, some in which Harry is the only child and others in which he has 
brothers or sisters. Consequently, even if one also knows (by an assertion) 
that Harry is male, one cannot deduce that all of Peter’s children are male. 
The only way of stating in an ABox that Harry is the only child is by 
doing so explicitly, that is by adding the assertion (^ 1 hasChild)(PETER). 
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hasChild(JOCASTA, OEDIPUS) hasChild(JOCASTA, POLYNEIKES) 

hasChildfOEDIPUS. POLYNEIKES) hasChild) POLYNEIKES, THERSANDROS) 
Patricide(OEDIPUS) ^Patricide(THERSANDROS) 

Fig. 2.5. The Oedipus ABox A oe - 

This means that, while the information in a database is always understood 
to be complete, the information in an ABox is in general viewed as being 
incomplete. The semantics of ABoxes is therefore sometimes characterized 
as an “open- world” semantics, while the traditional semantics of databases 
is characterized as a “closed- world” semantics. 

This view has consequences for the way queries are answered. Essentially, 
a query is a description of a class of objects. In our setting, we assume that 
queries are concept descriptions. A database (in the sense introduced above) 
is a listing of a single finite interpretation. A finite interpretation, say X, 
could be written up as a set of assertions of the form A(a) and R(b, c), where 
A is an atomic concept and R an atomic role. Siich a set looks syntactically 
like an ABox, but is not an ABox becaiise of the difference in semantics. 
Answering a query, represented by a complex concept C. over that database 
amounts to computing C x as it was defined in Subsection 2.2.1. From a 
logical point of view this means that query evaluation in a database is not 
logical reasoning, but finite model checking (i.e. , evaluation of a formula in 
a fixed finite model). 

Since an ABox represents possibly infinitely many interpretations, namely 
its models, query answering is more complex: it requires nontrivial reasoning. 
Here we are only concerned with semantical issues (algorithmic aspects will 
be treated in Section 2.3). To illustrate the difference between a semantics 
that identifies a database with a single model, and the open-world semantics 
of ABoxes, we discuss the so-called Oedipus example, which has stimulated 
a number of theoretical developments in DL research. 

Example 2.16 The example is based on the Oedipus story from ancient 
Greek mythology. In a nutshell, the story recounts how Oedipus killed his 
father, married his mother Jocasta, and had children with her, among them 
Polyneikes. Finally, Polyneikes also had children, among them Thersandros. 

We suppose the ABox A oe in Figure 2.5 represents some rudimentary 
facts about these events. For the sake of the example, our ABox asserts that 
Oedipus is a patricide and that Thersandros is not, which is represented 
using the atomic concept Patricide. 

Suppose now that we want to know from the ABox whether Jocasta has 
a child that is a patricide and that itself has a child that is not a patricide. 
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This can be expressed as the entailment problem 

A oe |= (3hasChild. (Patricide n 3hasChild.-iPatricide))(JOCASTA) ? 

One may be tempted to reason as follows. Jocasta has two children in the 
ABox. One, Oedipus, is a patricide. He has one child, Polyneikes. But noth- 
ing tells us that Polyneikes is not a patricide. So, Oedipus is not the child 
we are looking for. The other child is Polyneikes, but again, nothing tells 
us that Polyneikes is a patricide. So, Polyneikes is also not the child we 
are looking for. Based on this reasoning, one would claim that the assertion 
about Jocasta is not entailed. 

However, the correct reasoning is different. All the models of A oe can be 
divided into two classes, one in which Polyneikes is a patricide, and another 
one in which he is not. In a model of the first kind, Polyneikes is the child 
of Jocasta that is a patricide and has a child, namely Thersandros, that 
isn’t. In a model of the second kind, Oedipus is the child of Jocasta that is a 
patricide and has a child, namely Polyneikes, that isn’t. Thus, in all models 
Jocasta has a child that is a patricide and that itself has a child that is not 
a patricide (though this is not always the same child). This means that the 
assertion (HhasChild. (Patricide n 3hasChild.-iPatricide))(JOCASTA) is indeed 
entailed by A oc ■ ■ 

As this example shows, open- world reasoning may require case analyses. As 
will be explained in more detail in Chapter 3, this is one of the reasons 
why inferences in Description Logics are often more complex than query 
answering in databases. 


2.2.5 Rules 

The knowledge bases we have discussed so far consist of a TBox T and an 
ABox A. We denote such a knowledge base as a pair fC = (T, A). 

In some DL systems, such as Classic [Brachman et al, 1991] or Loom 
[MacGregor, 1991a], in addition to terminologies and world descriptions, one 
can also use rules to express knowledge. The simplest variant of such rules 
is an expression of the form 

D, 

where C. D are concepts. The meaning of such a rule is “if an individual is 
proved to be an instance of C, then derive that it is also an instance of D n . 
Such rules are often called trigger rules. 
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Operationally, the semantics of a finite set 1Z of trigger rules can be de- 
scribed by a forward reasoning process. Starting with an initial knowledge 
base 1C, a series of knowledge bases . . . is constructed, where 

/C(°) = 1C and /C( i+1 ) is obtained from 1C ^ by adding a new assertion D(a ) 
whenever 1Z contains a rule C => D such that /C^ |= C(a) holds, but 1C^ 
does not contain D(a). This process eventually halts because the initial 
knowledge base contains only finitely many individuals and there are only 
finitely many rules. Hence, there are only finitely many assertions D{a) that 
can possibly be added. The result of the rule applications is a knowledge 
base /C( n ) that has the same TBox as IC^l and whose ABox is augmented 
by the membership assertions introduced by the rules. We call this final 
knowledge base the procedural extension of 1C and denote it by 1C. It is easy 
to see that this procedural extension is independent of the order of rule ap- 
plications. Consequently, a set of trigger rules 1Z uniquely specifies how to 
generate, for each knowledge base 1C, an extended knowledge base 1C. The 
semantics of a knowledge base 1C, augmented by a set of trigger rules, can 
thus be understood as the set of models of 1C. 

This defines the semantics of trigger rules only operationally. It would 
be preferable to specify the semantics declaratively and then to prove that 
the extension computed with the trigger rules correctly represents this se- 
mantics. It might be tempting to use the declarative semantics of inclusion 
axioms as semantics for rules. However, this does not correctly reflect the 
operational semantics given above. An important difference between the 
trigger rule C =$■ D and the inclusion axiom C C D is that the trigger rule 
is not equivalent to its contrapositive —>D =>• —>C. In addition, when applying 
trigger rules one does not make a case analysis. For example, the inclusions 
CCD and —>C C D imply that every object belongs to D, whereas neither 
of the trigger rules C =>• D and —>C => D applies to an individual a for which 
neither C(a) nor -i C(a) can be proven. 

In order to capture the meaning of trigger rules in a declarative way, we 
must augment Description Logics by an operator K, which does not refer 
to objects in the domain, but to what the knowledge base knows about 
the domain. Therefore, K is an epistemic operator. More information on 
epistemic operators in Description Logics can be found in Chapter 6. 

To introduce the K operator, we enrich both the syntax and the seman- 
tics of description languages. Originally, the K operator was defined for 
ACC [Donini et al, 1992b; 1998a]. In this subsection, we discuss only how 
to extend the basic language AC. For other languages, one can proceed 
analogously (see also Chapter 6). 
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First, we add one case to the syntax rule in Subsection 2. 2. 1.1 that allows 
us to construct epistemic concepts: 

C,D — > KC (epistemic concept). 

Intuitively, the concept KC 1 denotes those objects for which the knowledge 
base knows that they are instances of C . 

Next, using K, we translate trigger rules C =>• D into inclusion axioms 

K C C D. (2.12) 

Intuitively, the K operator in front of the concept C has the effect that 
the axiom is only applicable to individuals that appear in the ABox and 
for which ABox and TBox imply that they are instances of C. Such a re- 
stricted applicability prevents the inclusion axiom from influencing satisfi- 
ability or subsumption relationships between concepts. In the sequel, we 
will define a formal semantics for the operator K that has exactly this 
effect. 

A rule knowledge base is a triple /C = ( T,A,1Z ), where T is a TBox, 
A is an ABox, and 1Z is a set of rules written as inclusion axioms of the 
form (2.12). The procedural extension of such a triple is the knowledge base 
1C = (T,A) that is obtained from (T, A) by applying the trigger rules as 
described above. 

The semantics of epistemic inclusions will be defined in such a way that it 
applies only to individuals in the knowledge base that provably are instances 
of C, but not to arbitrary domain elements, which would be the case if 
we dropped K. The semantics will go beyond first-order logic because we 
not only have to interpret concepts, roles and individuals, but also have to 
model the knowledge of a knowledge base. The fact that a knowledge base 
has knowledge about the domain can be understood in such a way that it 
considers only a subset W of the set of all interpretations as possible states 
of the world. Those individuals that are interpreted as elements of C under 
all interpretations in W are then “known” to be in C. 

To make this formal, we modify the definition of ordinary (first-order) 
interpretations by assuming that: 

1. there is a fixed countably infinite set A that is the domain of every interpreta- 
tion (common domain assumption); 

2. there is a mapping 7 from the individuals to the domain elements that fixes 
the way individuals are interpreted (rigid term assumption). 

The common domain assumption guarantees that all interpretations speak 
about the same domain. The rigid term assumption allows us to identify 
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each individual symbol with exactly one domain element. These assump- 
tions do not essentially reduce the number of possible interpretations. As a 
consequence, properties like satisfiability and subsumption of concepts are 
the same independently of whether we define them with respect to arbitrary 
interpretations or those that satisfy the above assumptions. 

Now, we define an epistemic interpretation as a pair [I, W), where X 
is a first-order interpretation and W is a set of first-order interpretations, 
all satisfying the above assumptions. Every epistemic interpretation gives 
rise to a unique mapping X)VS> associating concepts and roles with subsets 
of A and A x A, respectively. For T, _L, atomic concepts, negated atomic 
concepts, and atomic roles, x ^ agrees with x . For intersections, value re- 
strictions, and existential quantifications, the definition is similar to that 
of - x : 

(CnDf> w = C ,x ’ w n D iyv 

(' VR.C) 1 ’™ = {a e A | V 6 . (a, b) E R X ' W -»• b e C x ’ w } 

(3R. T) x ’ w = {a e A | 3b. (a, b) G R xysJ }. 

For other constructors, x ^ can be defined analogously. Note that for a 
concept C without an occurrence of K, the sets C x,w and C x are identical. 
The set of interpretations W comes into play when we define the semantics 
of the epistemic operator: 

(Ko x ’ w = p| c j w . 

J&M 

It would also be possible to allow the operator K to occur in front of roles 
and to define the semantics of role expressions of the form K R analogously. 
However, since epistemic roles are not needed to explain the semantics of 
rules, we restrict ourselves to epistemic concepts. 

An epistemic interpretation (X, W) satisfies an inclusion C C D if C x,w C 
D 1 ^ , and an equality C = D if C' x,w = D x ^ v . It satisfies an assertion 
(7(a) if a x,v V = 7 (a) £ C I,W , and an assertion R(a,b) if (a X ' W , b 1 ^) = 
( 7 (a), 7 ( 6 )) E R X ' W . It satisfies a rule knowledge base /C = (T,A,TZ) if it 
satisfies every axiom in X, every assertion in A, and every rule in 77. 

An epistemic model for a rule knowledge base /C is a maximal nonempty 
set W of first-order interpretations such that, for each X E W, the epistemic 
interpretation (X, W) satisfies 1C. 

Note that, if (T,A) is first-order satisfiable, then the set of all first-order 
models of (X, A) is the only epistemic model of the rule knowledge base 
tC = (X, A, 0), whose rule set is empty. A similar statement holds for arbi- 
trary rule knowledge bases. One can show that, if Wi and W -2 are epistemic 
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models, then the union W\ U W 2 is one, too, which implies Wi = W 2 because 
of the maximality of epistemic models. 

Proposition 2.17 Let tC = [T,A,1Z) be a rule knowledge base such that 
[T,A) is first-order satisfiable. Then K, has a unique epistemic model. 

Example 2.18 Let 1Z consist of the rule 

KStudent C Veats.JunkFood. (2.13) 

The rule states that “those individuals that are known to be students eat 
only junk food” . 

We consider the rule knowledge base K, 1 = (0, , 72.) , where 

Ai = {Student( PETER)}. 

Let us determine the epistemic model W of tC\. Every first-order inter- 
pretation IgW must satisfy A\ . Therefore, in every such X, we have that 
Student(PETER) is true, and thus Peter is known to be a student. Since W sat- 
isfies Rule (2.13), the assertion Veats.JunkFood(PETER) also holds in every X. 

For any other domain element a £ A, there is at least one interpretation 
in W where a is not a student. Thus, Peter is the only domain element 
to which the rule applies. Summing up, the epistemic model of consists 
exactly of the first-order models of A\ U {Veats.JunkFood(PETER)}. 

Next we demonstrate with this example that the epistemic semantics for 
rules disallows contrapositive reasoning. We consider the rule knowledge 
base IC 2 = (0, A 2 ,TZ), where 

A2 = {-iVeats.JunkFood(PETER)}. 

In this case, -iVeats.JunkFood(PETER) is true in every first-order interpreta- 
tion of the epistemic model W. However, because of the maximality of W, 
there is at least one interpretation in W in which Peter is a student and an- 
other one where Peter is not a student. Therefore, Peter is not known to be 
a student. Thus, the epistemic model of /C 2 consists exactly of the first-order 
models of A 2 - The rule is satisfied because the antecedent is false. ■ 

Clearly, the procedural extension of a rule knowledge base K, contains 
only assertions that must be satisfied by the epistemic model of 1C. It can 
be shown that the assertions added to K, by the rule applications are in 
fact, as stated in the following proposition, a first-order representation of 
the information that is implicit in the rules (see [Donini et al, 1998a] for a 
proof). 
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Proposition 2.19 Let 1C = (T,A,1Z) be a rule knowledge base. If{T,A) is 
first-order satisfiable, then the epistemic model of fC consists precisely of the 
first-order models of the procedural extension 1C = (T ,A). 


2.3 Reasoning algorithms 

In Subsection 2.2.4 we have seen that all the relevant inference problems 
can be reduced to the consistency problem for ABoxes, provided that the 
Description Logic at hand allows conjunction and negation. However, the 
description languages of all the early and also of some of the present day 
DL systems do not allow negation. For such Description Logics, subsump- 
tion of concepts can usually be computed by so-called structural subsump- 
tion algorithms, i.e. , algorithms that compare the syntactic structure of 
(possibly normalized) concept descriptions. In the first subsection, we will 
consider such algorithms in more detail. While they are usually very ef- 
ficient, they are only complete for rather simple languages with little ex- 
pressivity. In particular, Description Logics with (full) negation and dis- 
junction cannot be handled by structural subsumption algorithms. For 
such languages, so-called tableau-based algorithms have turned out to be 
very useful. In the area of Description Logics, the first tableau-based al- 
gorithm was presented by Schmidt-Schaufi and Srnolka [1991] for satisfi- 
ability of A£C-concepts. Since then, this approach has been employed to 
obtain sound and complete satisfiability (and thus also subsumption) algo- 
rithms for a great variety of Description Logics extending ACC (see, e.g., 
[Hollunder et ai, 1990; Hollunder and Baader, 1991a; Donini et ai, 1997a; 
Baader and Sattler, 1999] for languages with number restrictions; [Baader, 
1991] for transitive closure of roles and [Sattler, 1996; Horrocks and Sattler, 
1999] for transitive roles; and [Baader and Hanschke, 1991a; Hanschke, 1992; 
Haarslev et ai, 1999] for constructors that allow one to refer to concrete 
domains such as numbers). In addition, it has been extended to the consis- 
tency problem for ABoxes [Hollunder, 1990; Baader and Hollunder, 1991b; 
Donini et ai, 1994b; Haarslev and Moller, 2000], and to TBoxes allow- 
ing general sets of inclusion axioms and more [Buchheit et ai, 1993a; 
Baader et ai, 1996]. In the second subsection, we will first present a tableau- 
based satisfiability algorithm for „4£CA7-concepts, then show how it can be 
extended to an algorithm for the consistency problem for ABoxes, and fi- 
nally explain how general inclusion axioms can be taken into account. The 
third subsection is concerned with reasoning w.r.t. acyclic and cyclic termi- 
nologies. 
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Instead of designing new algorithms for reasoning in Description 
Logics, one can also try to reduce the problem to a known inference 
problem in logics (see also Chapter 4). For example, decidability of the 
inference problems for ACC and many other Description Logics can be 
obtained as a consequence of the known decidability result for the two- 
variable fragment of first-order predicate logic. The language C 2 consists 
of all formulae of first-order predicate logic that can be built with the 
help of predicate symbols (including equality) and constant symbols (but 
without function symbols) using only the variables x, y. Decidability of 
C 2 has been shown in [Mortimer, 1975]. It is easy to see that, by ap- 
propriately re-using variable names, any concept description of the lan- 
guage ACC can be translated into an £ 2 - formula with one free variable (see 
[Borgida, 1996] for details). A direct translation of the concept description 
VR.(3R.A) yields the formula \/y.(R(x, y) — > (3z.(R(y, z) A A(z)))). Since 
the subformula 3z.(R{y, z) A A(z)) does not contain x , this variable can 
be re-used: renaming the bound variable z to x yields the equivalent for- 
mula \/y.(R(x,y) — > (3x.(R(y, x) A A(x)))), which uses only two variables. 
This connection between ACC and C 2 shows that any extension of ACC 
by constructors that can be expressed with the help of only two variables 
yields a decidable Description Logic. Number restrictions and composition 
of roles are examples of constructors that cannot be expressed within C 2 . 
Number restrictions can, however, be expressed in C 2 , the extension of C 2 by 
counting quantifiers, which has recently been shown to be decidable [Gradel 
et al, 1997b; Pacholski et al, 1997]. It should be noted, however, that the 
complexity of the decision procedures obtained this way is usually higher 
than necessary: for example, the satisfiability problem for C 2 is NExp- 
TiME-complete, whereas satisfiability of ACC- concept descriptions is “only” 
PSPACE-complete. 

Decision procedures with lower complexity can be obtained by using 
the connection between Description Logics and propositional modal logics. 
Schild [1991] was the first to observe that the language ACC is a syntactic 
variant of the propositional multi-modal logic K, and that the extension of 
ACC by transitive closure of roles [Baader, 1991] corresponds to Proposi- 
tional Dynamic Logic (pdl). In particular, some of the algorithms used in 
propositional modal logics for deciding satisfiability are very similar to the 
tableau-based algorithms newly developed for Description Logics. This con- 
nection between Description Logics and modal logics has been used to trans- 
fer decidability results from modal logics to Description Logics [Schild, 1993; 
1994; De Giacomo and Lenzerini, 1994a; 1994b] (see also Chapter 5). In- 
stead of using tableau-based algorithms, decidability of certain propositional 
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modal logics (and thus of the corresponding Description Logics) can also be 
shown by establishing the finite model property (see, e.g., [Fitting, 1993], 
Section 1.14) of the logic (i.e., showing that a formula or concept is sat- 
isfiable iff it is satisfiable in a finite interpretation) or by employing tree 
automata (see, e.g., [Vardi and Wolper, 1986]). 


2.3.1 Structural subsumption algorithms 

These algorithms usually proceed in two phases. First, the descriptions to 
be tested for subsumption are normalized, and then the syntactic structure 
of the normal forms is compared. For simplicity, we first explain the ideas 
underlying this approach for the small language T Co, which allows con- 
junction (C n D) and value restrictions ( MR.C ). Subsequently, we show how 
the bottom concept (_L), atomic negation (-A), and number restrictions 
(^ nR and ^ nR ) can be handled. Evidently, LF Co and its extension by 
bottom and atomic negation are sublanguages of AC, while adding number 
restrictions to the resulting language yields the Description Logic ACM . 
An T Co-concept description is in normal form iff it is of the form 

Ai n • • • n A m n \IR X .C X n • • • n \/R n .C n , 

where A\, . . . , A m are distinct concept names, R\, . . . , R n are distinct role 
names, and C\, . . . , C n are J-"£o- conce Pt descriptions in normal form. It is 
easy to see that any description can be transformed into an equivalent one 
in normal form, using associativity, commutativity and idempotence of n, 
and the fact that the descriptions \/R.(C n D) and (VR.C) n (VR.D) are 
equivalent. 

Proposition 2.20 Let 

Ai n • • • n A m n V-RiA n • • • n VR n .C n 

be the normal form of the F Cq- concept description C , and 
Bin---r\B k n \/Si.D 1 n • • • n VS*. A 

the normal form of the F Co-concept description D. Then C C D iff the 
following two conditions hold: 

1. for all i, 1 < i < k, there exists j, 1 < j < m such that Bi = Aj; 

2. for all i, 1 < i < l, there exists j, 1 < j < n such that Si = Rj and Cj C 

It is easy to see that this characterization of subsumption is sound (i.e., 
the “if” direction of the proposition holds) and complete (i.e., the “only if” 
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direction of the proposition holds as well). This characterization yields an 
obvious recursive algorithm for computing subsumption, which can easily 
be shown to be of polynomial time complexity [Levesque and Brachman, 
1987]. 

If we extend TjCq by language constructors that can express unsatisfi- 
able concepts, then we must, on the one hand, change the definition of the 
normal form. On the other hand, the structural comparison of the normal 
forms must take into account that an unsatisfiable concept is subsumed by 
every concept. The simplest Description Logic where this occurs is J-C j_, 
the extension of RLq by the bottom concept _L. 

An VLCx-concept description is in normal form iff it is _L or of the form 

Ai n • • • n A m n Vi?i.Ci n • • • n VR n .C n , 

where A\, , A m are distinct concept names different from _L, R±, , R n 
are distinct role names, and C\, ... , C n are iTC^-concept descriptions in 
normal form. Again, such a normal form can easily be computed. In prin- 
ciple, one just computes the ■i r £o- norma l form of the description (where 
_L is treated as an ordinary concept name): B\ n • • • n B f. n \/R\.D\ n • • • n 
\/R n .D n . If one of the Bj.s is _L, then replace the whole description by _L. 
Otherwise, apply the same procedure recursively to the Dj s. For example, 
the JF£ 0 -norrnal form of Vi?.Vi?.L? nil! Vi?. (A n Vi?._L) is 

A n Vi?. (A n Vi?. (5 n T)), 
which yields the .i-X^-normal form 

AnVi?.(AnVi?.T). 

The structural subsumption algorithm for works just like the one for 
TC o, with the only difference that _L is subsumed by any description. For 
example, Vi?.Vi?.L> n A n Vi?. (A n Vi?._L) C Vi?.Vi?.A n A n Vi?. A since the 
recursive comparison of their JAC^-normal forms A n Vi?. (A n Vi?._L) and 
A n Vi?. (A n Vi?. A) finally leads to the comparison of _L and A. 

The extension of J 7 C± by atomic negation (i.e., negation applied to 
concept names only) can be treated similarly. During the computation 
of the normal form, negated concept names are just treated like concept 
names. If, however, a name and its negation occur on the same level of 
the normal form, then _L is added, which can then be treated as described 
above. For example, Vi?.-iA n A n Vi?.(A n Vi?.L?) is first transformed into 
An Vi?. (An -.An Vi?. 5), then into A n Vi?.(_L n A n -A n VR.B), and fi- 
nally into A n Vi?._L. The structural comparison of the normal forms treats 
negated concept names just like concept names. 
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Finally, if we consider the language ACM , the additional presence of num- 
ber restrictions leads to a new type of conflict. On the one hand, as in the 
case of atomic negation, number restrictions may be in conflict with each 
other (e.g., ^ 2 R and ^ 1 R). On the other hand, at-least restrictions ^ nR 
for n > 1 are in conflict with value restrictions Vi?._L that prohibit role 
successors. When computing the normal form, one can again treat num- 
ber restrictions like concept names, and then take care of the new types 
of conflicts by introducing _L and using it for normalization as described 
above. During the structural comparison of normal forms, one must also 
take into account inherent subsumption relationships between number re- 
strictions (e.g., ^ n R C ^ m R iff n > m). A more detailed description of a 
structural subsumption algorithm working on a graph-like data structure for 
a language extending ACM can be found in [Borgida and Patel-Schneider, 
1994], 

For larger Description Logics, structural subsumption algorithms usually 
fail to be complete. In particular, they cannot treat disjunction, full negation, 
and full existential restriction 3R.C. For languages including these construc- 
tors, the tableau approach to designing subsumption algorithms has turned 
out to be quite useful. 


2.3.2 Tableau algorithms 

Instead of directly testing subsumption of concept descriptions, these algo- 
rithms use negation to reduce subsumption to (un)satisfiability of concept 
descriptions: as we have seen in Subsection 2.2.4, C C D iff C n —>D is un- 
satisfiable. 

Before describing a tableau-based satisfiability algorithm for ACCM in 
more detail, we illustrate the underlying ideas by two simple examples. Let 
A, B be concept names, and let R be a role name. 

As a first example, assume that we want to know whether (3R. A) n 
(3R.B) is subsumed by 3 R.(A n B). This means that we must check whether 
the concept description 

C = (3 R.A) n (3 R.B) n -.(3 R.(A n B)) 

is unsatisfiable. 

First, we push all negation signs as far as possible into the description, 
using De Morgan’s rules and the usual rules for quantifiers. As a result, we 
obtain the description 

C 0 = (3 R.A) n (3 R.B) n \/R.(->A U 
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which is in negation normal form, i.e., negation occurs only in front of con- 
cept names. 

Then, we try to construct a finite interpretation T such that C x 7^ 0. This 
means that there must exist an individual in A x that is an element of Cq. 

The algorithm just generates such an individual, say b, and imposes the 
constraint b G Cq on it. Since Co is the conjunction of three concept de- 
scriptions, this means that b must satisfy the following three constraints: 
b G (3d?.A) x , b G (3R.B) 1 , and b G ( Vd?.(^A U -,B)) X . 

From b G (3d?.A) x we can deduce that there must exist an individual 
c such that (6, c) G R 1 and c G A 1 . Analogously, b G ( 3R.B ) x implies the 
existence of an individual d with ( b , d) G R x and d G B x . In this situation, 
one should not assume that c = d since this would possibly impose too many 
constraints on the individuals newly introduced to satisfy the existential 
restrictions on b. Thus: 

• For any existential restriction the algorithm introduces a new individual as role 
filler, and this individual must satisfy the constraints expressed by the restriction. 

Since b must also satisfy the value restriction Vd?.(^A U ->5), and c, d 
were introduced as d?-fillers of b, we obtain the additional constraints c G 
(-1 A LI -1 B) x and d G ip A LI — 1 B) x . Thus: 

• The algorithm uses value restrictions in interaction with already defined role re- 
lationships to impose new constraints on individuals. 

Now c G (-1 A U -i B) x means that c G (-iA) x or c G (-1 B) x , and we must 
choose one of these possibilities. If we assume c G (-1 A) x , this clashes with 
the other constraint c G A 1 , which means that this search path leads to 
an obvious contradiction. Thus we must choose c G (~<B) X . Analogously, we 
must choose d G (-iA) x in order to satisfy the constraint d G ip A U -1 B) x 
without creating a contradiction to d G B x . Thus: 

• For disjunctive constraints, the algorithm tries both possibilities in successive at- 
tempts. It must backtrack if it reaches an obvious contradiction, i.e., if the same 
individual must satisfy constraints that are obviously conflicting. 

In the example, we have now satisfied all the constraints without en- 
countering an obvious contradiction. This shows that Co is satisfiable, and 
thus (3R.A) n (3R.B) is not subsumed by 3d?.(A n B ). The algorithm has 
generated an interpretation 1 as witness for this fact: A x = {b, c, d}; R 1 = 
{( b , c), (b, d)}; A 1 = {c} and B x = {d}. For this interpretation, b G Cq. This 
means that b G ((3d?. A) n (3d?. id)) 2 ', but b fL (3d?. (A n B)) x . 

In our second example, we add a number restriction to the first concept of 
the above example, i.e., we now want to know whether (3d?. A) n (3d?.dd) n 
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^ 1 R is subsumed by 3R.(A\1 B). Intuitively, the answer should now be 
“yes” since ^ 1 R in the first concept ensures that the i?-filler in A coincides 
with the R- filler in B, and thus there is an R- filler in An B. The tableau- 
based satisfiability algorithm first proceeds as above, with the only difference 
that there is the additional constraint b £ 1 R) x . In order to satisfy this 

constraint, the two .R-fillers c,d of b must be identified with each other. 
Thus: 

• If an at-most number restriction is violated then the algorithm must identify dif- 
ferent role fillers. 

In the example, the individual c = d must belong to both A 1 and B 1 , 
which together with c = d£ (-A U -> B) x always leads to a clash. Thus, the 
search for a counterexample to the subsumption relationship fails, and the 
algorithm concludes that ( 3R.A ) n (3R.B) n ^ 1 R C 3R.(A n B). 

2.3.2. 1 A tableau-based satisfiability algorithm for ACCN 
Before we can describe the algorithm more formally, we need to introduce 
an appropriate data structure in which to represent constraints like “a be- 
longs to (the interpretation of) C” and “6 is an /?,-filler of a” . The original 
paper by Schmidt-Schaufi and Srnolka [1991], and also many other papers 
on tableau algorithms for Description Logics, introduce the new notion of 
a constraint system for this purpose. However, if we look at the types of 
constraints that must be expressed, we see that they can actually be repre- 
sented by ABox assertions. As we have seen in the second example above, 
the presence of at-most number restrictions may lead to the identification 
of different individual names. For this reason, we will not impose the unique 
name assumption on the ABoxes considered by the algorithm. Instead, we 
allow explicit inequality assertions of the form x y for individual names 
x, y. with the obvious semantics that an interpretation 1 satisfies x y iff 
x 1 y 1 . These assertions are assumed to be symmetric, i.e. , saying that 

x y belongs to an ABox A is the same as saying that y A x belongs to A. 

Let Co by an ATCAAconcept in negation normal form. In order to test 
satisfiability of Co, the algorithm starts with the ABox Ao = {Co(xo)}, and 
applies consistency-preserving transformation rules (see Figure 2.6) to the 
ABox until no more rules apply. If the “complete” ABox obtained this way 
does not contain an obvious contradiction (called clash), then Ao is consis- 
tent (and thus Co is satisfiable) , and inconsistent (unsatisfiable) otherwise. 
The transformation rules that handle disjunction and at-most restrictions 
are non- deterministic in the sense that a given ABox is transformed into 
finitely many new ABoxes such that the original ABox is consistent iff one 
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The — > n -rule 

Condition A contains (Ci n C 2 )(x), but it does not contain both Ci(x) and C 2 (x). 

Action A' = A U {C\(x), C 2 {x)}. 

The — > u -rule 

Condition A contains (Ci U but neither Ci(®) nor C 2 (x). 

Action A' = A U {Ci(3")}, A" = A U {C 2 (x)}. 

The — >3-rule 

Condition A contains ( BR.C)(x ), but there is no individual name * such that C(z) 

and R{x,z) are in A. 

Action A! = A U {C(y),R(x, y)} where y is an individual name not occurring in A. 

The — >y-rule 

Condition A contains (WR.C)(x) and R(x,y ), but it does not contain C{y). 

Action A! = A U {C(y)}. 

The — >>-rule 

Condition A contains n R){x), and there are no individual names zi. ... ,z n such 
that R(x, Zi) (1 < i < n) and Zi jk Zj (1 < i < j < n) are contained in A. 

Action A' = A U {R(x, yi) | 1 < i < n} U {yi ^ yj \ 1 < i < j < n}, where yi,...,y n 
are distinct individual names not occurring in A. 

The — ><-rule 

Condition A contains distinct individual names yi, . . . , y n +i such that n R)(x) 
and R{x, yi ), . . . , R(x, y n + i) are in A, and yi ^ % is not in A for some i ^ j. 

Action For each pair yt,yj such that i > j and yi ^ yj is not in .A, the ABox 
Aij — [yi/yj]A is obtained from A by replacing each occurrence of yt by yj. 


Fig. 2.6. Transformation rules of the satisfiability algorithm. 

of the new ABoxes is so. For this reason we will consider finite sets of 
ABoxes S = {A\ . . . . , instead of single ABoxes. Such a set is consistent 
iff there is some i, 1 < i < k, such that Ai is consistent. A rule of Figure 2.6 
is applied to a given finite set of ABoxes S as follows: it takes an element 
A of S, and replaces it by one ABox A', by two ABoxes A and A\ or by 
finitely many ABoxes Ai j. 

The following lemma is an easy consequence of the definition of the trans- 
formation rules: 


Lemma 2.21 (Soundness) Assume that S' is obtained from the finite set 
of ABoxes S by application of a transformation rule. Then S is consistent 
iff S' is consistent. 

The second important property of the set of transformation rules is that 
the transformation process always terminates: 


Lemma 2.22 (Termination) Let Co be an ACC M -concept description 
in negation normal form. There cannot be an infinite sequence of rule 
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applications 


{{O)(*o)}} - Si - cS 2 


The main reasons for this lemma to hold are the following. 6 


Lemma 2.23 Let A be an ABox contained in Si for some i > 1. 

• For every individual x A x o occurring in A, there is a unique sequence R\,. . . ,Ri 
(l > 1) of role names and a unique sequence xi, . . . , xi-\ of individual names 
such that {i?i(;ro, xi), RAxi, xf ), . . . , Rfxi-i,x)} C A. In this case, we say that 
x occurs on level l in A. 

• If C(x) £ A for an individual name x on level l, then the maximal role depth of 
C (i.e., the maximal nesting of constructors involving roles) is bounded by the 
maximal role depth of Cq minus l. Consequently, the level of any individual in A 
is bounded by the maximal role depth of Cq. 

• If C(x) £ A, then C is a subdescription of Cq- Consequently, the number of dif- 
ferent concept assertions on x is bounded by the size of Cq. 

• The number of different role successors of x in A (i.e., individuals y such that 
R(x,y) £ A for a role name R) is bounded by the sum of the numbers occurring 
in at-least. restrictions in Cq plus the number of different existential restrictions 
in Cq. 


Starting with {{Co(xo)}}, we thus obtain after a finite number of rule 
applications a set of ABoxes S to which no more rules apply. An ABox A 
is called complete iff none of the transformation rules applies to it. Consis- 
tency of a set of complete ABoxes can be decided by looking for obvious 
contradictions, called clashes. The ABox A contains a clash iff one of the 
following three situations occurs: 

1. {_L(x)} C A for some individual name x; 

2. {A(x), ~^A(x)} C A for some individual name x and some concept name A; 

3. {(<ni?)(x)} U {R(x,yi) | 1 < i < n + 1} U {y t ft yj | 1 < i < j < n + 1} C A 
for individual names x, yi , . . . , y n +i, a nonnegative integer n, and a role name 
R. 

Obviously, an ABox that contains a clash cannot be consistent. Hence, if 
all the ABoxes in S contain a clash, then S is inconsistent, and thus by 
the soundness lemma {C'o(xo)} is inconsistent as well. Consequently, Cq is 
unsatisfiable. If, however, one of the complete ABoxes in S is clash-free, 

6 A detailed proof of termination for a set of rules extending the one of Figure 2.6 can be found 
in [Baader and Sattler, 1999] . A termination proof for a slightly different set of rules has been 
given in [Donini et al., 1997a]. 
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then S is consistent. By soundness of the rules, this implies consistency of 
{Co(xo)}, and thus satisfiability of Cq. 

Lemma 2.24 (Completeness) Any complete and clash-free ABox A has 
a model. 

This lemma can be proved by defining the canonical interpretation X 4 
induced by A: 

1 . the domain A 1 - 4 of X 4 consists of all the individual names occurring in A; 

2. for all atomic concepts A we define A Xa = {x \ A(x ) G .4}; 

3. for all atomic roles R we define R Xa = {( x,y ) j R(x,y) G A}. 

By definition, T 4 satisfies all the role assertions in A. By induction on the 
structure of concept descriptions, it is easy to show that it satisfies the 
concept assertions as well. The inequality assertions are satisfied since x 
y G A only if x, y are different individual names. 

The facts stated in Lemma 2.23 imply that the canonical interpretation 
has the shape of a finite tree whose depth is linearly bounded by the size 
of Cq and whose branching factor is bounded by the sum of the numbers 
occurring in at-least restrictions in Co plus the number of different existential 
restrictions in Co- Consequently, ACCM has the finite tree model property , 
i.e. , any satisfiable concept Co is satisfiable in a finite interpretation X that 
has the shape of a tree whose root belongs to Co- 

To sum up, we have seen that the transformation rules of Figure 2.6 
reduce satisfiability of an ACCM -concept Co (in negation normal form) to 
consistency of a finite set S of complete ABoxes. In addition, consistency of 
S can be decided by looking for obvious contradictions (clashes). 

Theorem 2.25 It is decidable whether or not an ACC M -concept is satis- 
fiable. 


2. 3. 2. 2 Complexity issues 

The tableau-based satisfiability algorithm for ACCM presented above 
may need exponential time and space. In fact, the size of the canonical 
interpretation built by the algorithm may be exponential in the size of the 
concept description. For example, consider the descriptions C n (n > 1), 
which are inductively defined as follows: 

Ci = 3R.A n 3R.B 
C n+ i = 3R.A n 3R.B n MR.C n . 
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Obviously, the size of C n grows linearly in n. However, given the input 
description C n , the satisfiability algorithm introduced above generates a 
complete and clash-free ABox whose canonical model is the full binary tree 
of depth n, and thus consists of 2” +1 — 1 individuals. 

Nevertheless, the satisfiability algorithm can be modified such that it 
needs only polynomial space. The main reason is that different branches of 
the tree model to be generated by the algorithm can be investigated sepa- 
rately. Since the complexity class NPSpace coincides with PSpace [Savitch, 
1970], it is sufficient to describe a non-deterministic algorithm using only 
polynomial space, i.e., for every non-deterministic rule we may simply 
assume that the algorithm chooses the correct alternative. In principle, the 
modified algorithm works as follows: it starts with {C'o(xo)} and: 

1. applies the — > n - and —►□-rules as long as possible, and checks for clashes of 
the form A(x o), -^A(xo) and _L(xo); 

2. generates all the necessary direct successors of xo using the — >a- and the — ►>- 
rule; 

3. generates the necessary identifications of these direct successors using the — ><- 
rule, and checks for clashes caused by at-most restrictions; 

4. successively handles the successors in the same way. 

Since after identification the remaining successors can be treated separately, 
the algorithm needs to store only one path of the tree model to be generated, 
together with the direct successors of the individuals on this path and the 
information which of these successors must be investigated next. We already 
know that the length of the path is linear in the size of the input description 
Cq. Thus, the only remaining obstacle on our way to a PSPACE-algorithm 
is the fact that the number of direct successors of an individual on the path 
also depends on the numbers in the at-least restrictions. If we assumed these 
numbers to be written in base 1 representation (where the size of the repre- 
sentation coincides with the number represented), this would not be a prob- 
lem. However, for bases larger than 1 (e.g., numbers in decimal notation), 
the number represented may be exponential in the size of the representation. 
For example, the representation of 10” — 1 requires only n digits in base 10 
representation. Thus, we cannot introduce all the successors required by at- 
least restrictions while only using polynomial space in the size of the concept 
description if the numbers in this description are written in decimal notation. 

It turns out, however, that most of the successors required by the at- 
least restrictions need not be introduced at all. If an individual x obtains 
at least one i?-successor due to the application of the — ►g-rule, then the 
— >>-rule need not be applied to x for the role R. Otherwise, we simply 
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introduce one R- successor as representative. In order to detect inconsisten- 
cies due to conflicting number restrictions, we need to add a new type of 
clash : {(^nR)(x),(^mR)(x)} C A for nonnegative integers n < m. The 
canonical interpretation obtained by this modified algorithm need not sat- 
isfy the at-least restrictions in Cq. However, it can easily be modified to an 
interpretation that does, by duplicating i?-successors (more precisely, the 
whole subtrees starting at these successors). 

Theorem 2.26 Satisfiability of ACC Af -concept descriptions is PSpace- 
complete. 

The above argument shows that the problem is in PSpace. The hardness 
result follows from the fact that the satisfiability problem is already PSpace- 
hard for the sublanguage ACC, which can be shown by a reduction from 
validity of Quantified Boolean Formulae [Schmidt-Schaufi and Srnolka, 1991]. 
Since subsumption and satisfiability of ACC Af -concept descriptions can be 
reduced to each other in linear time, this also shows that subsumption of 
ACCAI -concept descriptions is PS PACE-complete. 

2. 3. 2. 3 Extension to the consistency problem for ABoxes 

The tableau-based satisfiability algorithm described in Subsection 2. 3. 2.1 
can easily be extended to an algorithm that decides consistency of ACCAI - 
ABoxes. Let A be an ACCAf-ABox such that (w.l.o.g.) all concept descrip- 
tions in A are in negation normal form. To test A for consistency, we first 
add inequality assertions a fib for every pair of distinct individual names 
a, b occurring in A. 7 Let Aq be the ABox obtained this way. The consistency 
algorithm applies the rules of Figure 2.6 to the singleton set {-4o}- 

Soundness and completeness of the rule set can be shown as before. Un- 
fortunately, the algorithm need not terminate, unless one imposes a specific 
strategy on the order of rule applications. For example, consider the ABox 

A 0 = { R{a , a), (3R.A)(a), « 1 R){a), (' VR3R.A)(a )}. 

By applying the — ►g-rule to a, we can introduce a new i?-successor x of a: 

A\ = Ao U {R(a, x), A(x)}. 

The — >v-rule adds the assertion (3 R.A)(x), which triggers an application of 
the — tg-rule to x. Thus, we obtain the new ABox 

A 2 = AiU {(3 R.A)(x), R(x,y), A{y)}. 


7 This takes care of the UNA. 
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Since a has two ^-successors in A 2 , the — ><-rule is applicable to a. By- 
replacing every occurrence of x by a, we obtain the ABox 

A 3 = A 0 u {A(a), R(a, y), A(y)}. 

Except for the individual names (and the assertion A(a), which is, however, 
irrelevant), ^3 is identical to A\ . For this reason, we can continue as above 
to obtain an infinite chain of rule applications. 

We can easily regain termination by requiring that generating rules (i.e. , 
the rules —>3 and — >>) may only be applied if none of the other rules is 
applicable. In the above example, this strategy would prevent the application 
of the — > 3 -rule to x in the ABox A\ U {(3i?.A)(x)} since the — ><-rule is also 
applicable. After applying the — ><-rule (which replaces a: by a), the — >g-rule 
is no longer applicable since a already has an i?-successor that belongs to 
A. 

Using a similar idea, one can reduce the consistency problem for ACCAf- 
ABoxes to satisfiability of ACCAf -concept descriptions [Hollunder, 1996]. In 
principle, this reduction works as follows. In a preprocessing step, one applies 
the transformation rules only to old individuals (i.e., individuals present in 
the original ABox). Subsequently, one can forget about the role assertions, 
i.e., for each individual name in the preprocessed ABox, the satisfiability 
algorithm is applied to the conjunction of its concept assertions (see [Hol- 
lunder, 1996] for details). 

Theorem 2.27 Consistency of ACCM -ABoxes is PSpace -complete. 

2.3. 2-4 Extension to general inclusion axioms 

In the above subsections, we have considered the satisfiability problem for 
concept descriptions and the consistency problem for ABoxes without an 
underlying TBox. In fact, for acyclic TBoxes one can simply expand the 
definitions (see Subsection 2.2.4). Expansion is, however, no longer possible 
if one allows general inclusion axioms of the form CCD, where C and 
D may be complex descriptions. Instead of considering finitely many such 
axioms C\ CDi,..., C n C D n , it is sufficient to consider the single axiom 
TCC, where 

c = (-Ci u Eh) n • • • n (-c n u D n ). 

The axiom TCC simply says that any individual must belong to the con- 
cept C. The tableau algorithm introduced above can easily be modified 
such that it takes this axiom into account: all individuals (both the origi- 
nal individuals and the ones newly generated by the — >g-rule and the — »>- 
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rule) are simply asserted to belong to C. However, this modification may 
obviously lead to nontermination of the algorithm. For example, consider 
what happens if this algorithm is applied to test consistency of the ABox 
Aq = {A(xo), (3i?.A)(xo)} w.r.t. the axiom T C 3R.A: the algorithm gen- 
erates an infinite sequence of ABoxes Ai,A 2 ,--- and individuals aq, X 2 , ■ ■ ■ 
such that Ai+i = Ai U {R(xi, aq+i), A(aq+ 1 ), (3i?.A)(xj+i)}. Since all indi- 
viduals Xi receive the same concept assertions as xq, we may say that the 
algorithms has run into a cycle. 

Termination can be regained by trying to detect such cyclic computations, 
and then blocking the application of generating rules: the application of the 
rules — >3 and — »> to an individual x is blocked by an individual y in an 
ABox A iff {D | D(x) G .4} C {D' \ D'(y) G A}. The main idea underlying 
blocking is that the blocked individual x can use the role successors of y 
instead of generating new ones. For example, instead of generating a new R- 
successor for x\ in the above example, one can simply use the i?-successor 
of xq. This yields an interpretation X with A x = {xo,aq}, A 1 = A 1 , and 
R 1 = {(xo,a:i), (xi,aq)}. Obviously, X is a model of Ao and of the axiom 
T C 3R.A. 

To avoid cyclic blocking (of x by y and vice versa) , we consider an enumer- 
ation of all individual names, and define that an individual x may only be 
blocked by individuals y that occur before x in this enumeration. This, to- 
gether with some other technical assumptions, makes sure that an algorithm 
using this notion of blocking is sound and complete as well as terminating 
(see [Buchheit et al. , 1993a; Baader et ai, 1996] for details). Thus, con- 
sistency of ACCAf -ABoxes w.r.t. general inclusion axioms is decidable. It 
should be noted that the algorithm is no longer in PSpace since it may 
generate role paths of exponential length before blocking occurs. In fact, 
even for the language ACC, satisfiability w.r.t. a single general inclusion 
axiom is known to be ExpTiME-hard [Schild, 1994] (see also Chapter 3). 
The tableau-based algorithm sketched above is a NExpTime algorithm. 
However, using the translation technique mentioned at the beginning of 
this section, it can be shown [De Giacomo, 1995] that A£CAA ABoxes and 
general inclusion axioms can be translated into PDL, for which satisfiabil- 
ity can be decided in exponential time. An ExpTime tableau algorithm 
for ACC with general inclusion axioms was described by Donini and Mas- 
sacci [ 2000 ]. 


Theorem 2.28 Consistency of ACCJ\f -ABoxes w.r.t. general inclusion ax- 
ioms is ExpTime -complete. 
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2. 3. 2. 5 Extension to other language constructors 
The tableau-based approach to designing concept satisfiability and ABox 
consistency algorithms can also be employed for languages with other con- 
cept and/or role constructors. In principle, each new constructor requires a 
new rule, and this rule can usually be obtained by simply considering the 
semantics of the constructor. Soundness of such a rule is often very easy to 
show. More problematic are completeness and termination since they must 
also take interactions between different rules into account. As we have seen 
above, termination can sometimes only be obtained if the application of rules 
is restricted by an appropriate strategy. Of course, one may only impose such 
a strategy if one can show that it does not destroy completeness. 


2.3.3 Reasoning w.r.t. terminologies 

Recall that terminologies (TBoxes) are sets of concept definitions (i.e., equal- 
ities of the form A = C where A is atomic) such that every atomic concept 
occurs at most once as a left-hand side. We will first comment briefly on 
the complexity of reasoning w.r.t. acyclic terminologies, and then consider 
in more detail reasoning w.r.t. cyclic terminologies. 

2.3.3. 1 Acyclic terminologies 

As shown in Subsection 2.2.4, reasoning w.r.t. acyclic terminologies can be 
reduced to reasoning without terminologies by first expanding the TBox, 
and then replacing name symbols by their definitions in the terminology. 
Unfortunately, since the expanded TBox may be exponentially larger than 
the original one [Nebel, 1990b], this increases the complexity of reasoning. 
Nebel [1990b] also shows that this complexity can, in general, not be avoided: 
for the language EC o, subsumption between concept descriptions can be 
tested in polynomial time (see Subsection 2.3.1), whereas subsumption w.r.t. 
acyclic terminologies is coNP-complete (see also Subsection 2. 3. 3. 2 below). 

For more expressive languages, the presence of acyclic TBoxes may or may 
not increase the complexity of the subsumption problem. For example, sub- 
sumption of concept descriptions in the language ACC is PS PACE-complete, 
and so is subsumption w.r.t. acyclic terminologies [Lutz, 1999a]. Of course, in 
order to obtain a PSPACE-algorithm for subsumption in ACC w.r.t. acyclic 
TBoxes, one cannot first expand the TBox completely since this might need 
exponential space. The main idea is that one uses a tableau-based algorithm 
like the one described in Subsection 2.3.2, with the difference that it receives 
concept descriptions containing name symbols as input. Expansion is then 
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done on demand: if the tableau-based algorithm encounters an assertion of 
the form A{x), where A is a name occurring on the left-hand side of a defi- 
nition A = C in the TBox, then it adds the assertion C(x). However, it does 
not further expand C at this stage. It is not hard to show that this really 
yields a P Space- algorithm for satisfiability (and thus also for subsumption) 
of concepts w.r.t. acyclic TBoxes in ACC [Lutz, 1999a]. 

There are, however, extensions of ACC for which this technique no longer 
works. One such example is the language ACCiF, which extends ACC by 
functional roles as well as agreements and disagreements on chains of func- 
tional roles (see Section 2.4 below). Satisfiability of concepts is PSpace- 
complete for this language [Hollunder and Nutt, 1990], but satisfiability of 
concepts w.r.t. acyclic terminologies is NExpTlME-complete [Lutz, 1999a]. 

2. 3. 3. 2 Cyclic terminologies 

For cyclic terminologies, expansion is no longer possible since it would not 
terminate. If we use descriptive semantics, then cyclic terminologies are 
a special case of terminologies with general inclusion axioms. Thus, the 
tableau-based algorithm for handling general inclusion axioms introduced 
in Subsection 2. 3. 2. 4 can also be used for cyclic „4£CA7-TBoxes with de- 
scriptive semantics. For cyclic .4£C-TBoxes with fixpoint semantics, the 
connection between Description Logics and propositional modal logics turns 
out to be useful. In fact, syntactically monotone „4£C-TBoxes with least 
or greatest fixpoint semantics can be expressed within the propositional /jl- 
calculus, which is an extension of the propositional multi-modal logic K m 
by fixpoint operators (see [Schild, 1994; De Giacomo and Lenzerini, 1994b; 
1997] and Chapter 5 for details). Since reasoning w.r.t. general inclusion 
axioms in ACC and reasoning in the propositional /i-calculus are both Exp- 
TiME-complete, these reductions yield an ExpTime upper bound for rea- 
soning w.r.t. cyclic terminologies in sublanguages of ACC. 

For less expressive Description Logics, more efficient algorithms can, how- 
ever, be obtained with the help of techniques based on finite automata. 
Following [Baader, 1996b], we will sketch these techniques for the small lan- 
guage TCq. The results can, however, be extended to the language ACM 
[Kiisters, 1998]. We will develop the results for FCq in two steps, starting 
with an alternative characterization of subsumption between .F£o- conce Pt 
descriptions, and then extending this characterization to cyclic TBoxes 
with greatest fixpoint semantics. Baader [1996b] also considers cyclic FCq- 
TBoxes with descriptive and with least fixpoint semantics. For these seman- 
tics, the characterization of subsumption is more involved; in particular, 
the characterization of subsumption w.r.t. descriptive semantics depends on 
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finite automata working on infinite words, so-called Biichi automata. Acyclic 
TBoxes can be seen as a special case of cyclic TBoxes, where all three types 
of semantics coincide. 

In Subsection 2.3.1, the equivalence n (VR.D) = VR.(C n D) was 

used as a rewrite rule from left to right in order to compute the structural 
subsumption normal form of .TbCo-concept descriptions. If we use this rule 
in the opposite direction, we obtain a different normal form, which we call 
concept- centered normal form since it groups the concept description w.r.t. 
concept names (and not w.r.t. role names, as the structural subsumption 
normal form does). Using this rule, any .TTCo-concept description can be 
transformed into an equivalent description that is a conjunction of descrip- 
tions of the form Vi?i . • • • \/R m .A for m > 0 (not necessarily distinct) role 
names R\, , R m and a concept name A. We abbreviate VRi . • • • \/R m .A by 
Vi?i • • • R m .A, where R\ ■ ■ ■ R m is viewed as a word over the alphabet S of 
all role names. In addition, instead of Vuq.A n • • • n Vug. A we write VL.A 
where L = {uq, . . . , wi} is a finite set of words over E. The term V0.A is con- 
sidered to be equivalent to the top concept T, which means that it can be 
added to a conjunction without changing the meaning of the concept. Using 
these abbreviations, any pair of .U£o-concept descriptions C, D containing 
the concept names A\, , A^ can be rewritten as 

C = MJJ 1 .Ai n • • • n\/U k .A k and D = VVi.Ai n • • • n VV fc .A fc , 

where Ui, V are finite sets of words over the alphabet of all role names. This 
normal form provides us with the following characterization of subsumption 
of J r £o-concept descriptions [Baader and Narendran, 1998]: 

CCD iff Ui DVi for all i, 1 < i < k. 

Since the size of the concept-based normal forms is polynomial in the size 
of the original descriptions, and since the inclusion tests U t O Vj can also 
be realized in polynomial time, this yields a polynomial-time decision pro- 
cedure for subsumption in TLq. In fact, as shown in [Baader et al, 1998a], 
the structural subsumption algorithm for J-Cq can be seen as a special im- 
plementation of these inclusion tests. 

This characterization of subsumption via inclusion of finite sets of words 
can be extended to cyclic TBoxes with greatest fixpoint semantics as follows. 
A given TBox T can be translated into a finite automaton 8 At whose states 
are the concept names occurring in T and whose transitions are induced by 


Strictly speaking, we obtain a finite automaton with word transitions, i.e., transitions that may 
be labeled by a word over E rather than a letter of E. 



98 


F. Baader and W. Nutt 


A 

B 

C 


MR.A n MS.C 
VR.VS.C 
P n VS.C 


Fig. 2.7. A TBox and the corresponding automaton. 

the value restrictions occurring in T (see Figure 2.7 for an example and 
[Baader, 1996b] for the formal definition). 

For a name symbol A and a base symbol P in T, the language L. 4 T (A, P) 
is the set of all words labeling paths in At from A to P. The languages 
L At (A,P) represent all the value restrictions that must be satisfied by in- 
stances of the concept A. With this intuition in mind, the following char- 
acterization of subsumption w.r.t. cyclic PC^-TBoxes with greatest fixpoint 
semantics should not be surprising: 

A \P-t B iff L At (A, P) D L At (B, P) for all base symbols P. 

In the example of Fig. 2.7, we have L At (A,P) = R*SS* D RSS* = 
L At (B, P ), and thus A C-j B, but not B C 7 - A. 

Obviously, the languages L At (A,P) are regular, and any regular lan- 
guage can be obtained as such a language. Since inclusion of regular lan- 
guages is a PSPACE-complete problem [Garey and Johnson, 1979], this 
shows that subsumption w.r.t. cyclic XXo-TBoxes with greatest fixpoint 
semantics is PSPACE-complete [Baader, 1996b]. For an acyclic terminol- 
ogy T , the automaton At is acyclic as well. Since inclusion of lan- 
guages accepted by acyclic finite automata is coNP-complete, this proves 
Nebel’s result that subsumption w.r.t. acyclic XXo-TBoxes is coNP-complete 
[Nebel, 1990b]. 


2.4 Language extensions 

In Section 2.2 we have introduced the language ACC AT as a prototypical 
Description Logic. For many applications, the expressive power of ACCAf 
is not sufficient. For this reason, various other language constructors have 
been introduced in the literature and are employed by systems. Roughly, 
these language extensions can be put into two categories, which (for lack 
of a better name) we will call “classical” and “nonclassical” extensions. In- 
tuitively, a classical extension is one whose semantics can easily be defined 
within the model-theoretic framework introduced in Section 2.2, whereas 
defining the semantics of a nonclassical constructor is more problematic 
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and requires an extension of the model-theoretic framework (such as the 
semantics of the epistemic operator K introduced in Subsection 2.2.5). In 
this section, we briefly introduce the most important classical extensions 
of Description Logics. Inference procedures for such expressive Description 
Logics are discussed in Chapter 5. Nonclassical extensions are the subject of 
Chapter 6. 

In addition to constructors that can be used to build complex roles, we 
will introduce more expressive number restrictions, and constructors that 
allow one to express relationships between the role-filler sets of different 
(complex) roles. 


2.4-1 Role constructors 

Since roles are interpreted as binary relations, it is quite natural to employ 
the usual operations on binary relations (such as Boolean operators, compo- 
sition, inverse, and transitive closure) as role-forming constructors. Syntax 
and semantics of these constructors can be defined as follows: 

Definition 2.29 (Role constructors) Every role name is a role descrip- 
tion (atomic role), and if R, S are role descriptions, then RnS (intersec- 
tion), RU S (union), -i R (complement), Ro S (composition), R + (transitive 
closure), R (inverse) are also role descriptions. 

A given interpretation X is extended to (complex) role descriptions as 
follows: 

1. (R n S) x = R 1 n S x , (R U S) 1 = R 1 U S x , (^R) x = A 1 x A 1 \ R x - 

2. (R o S) x = {(a, c) G A x x A x | 3b. (a, 6) G R x A (6, c) G S x }\ 

3. (i? + ) x = U ! :>i(^ X )b he., {R + ) x is the transitive closure of {R x ); 

4. ( R-) x = {(b,a) G A x x A x | (a, b) G R 1 }. 

For example, the union of the roles hasSon and hasDaughter can be used 
to define the role hasChild, and the transitive closure of hasChild expresses 
the role hasDescendants. The inverse of hasChild yields the role hasParent. 

The complexity of satisfiability and subsumption of concepts in the lan- 
guage ACCAf n (also called AjCCMIZ in the literature), which extends ACCAf 
by intersection of roles, has been investigated in [Donini et al, 1997a]. 
It is shown that these problems are still PSPACE-complete, provided that 
the numbers occurring in number restrictions are written in base 1 repre- 
sentation (where the size of the representation coincides with the number 
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represented). Tobies [2001b] shows that this result also holds for non-unary 
coding of numbers. Decidability of the extension of ACCM by the three 
Boolean operators and the inverse operator is an immediate consequence 
of the fact that concepts of the extended language can be expressed in 
C 2 , i.e. , first-order predicate logic with two variables and counting quan- 
tifiers, which is known to be decidable in NExpTime [Gradel et al, 1997b; 
Pacholski et al . , 1997]. Lutz and Sattler [2000a] show that ACC extended 
by role complement is ExpTlME-complete, whereas ACC extended by role 
intersection and (atomic) role complement is NExpTiME-complete. 

In [Baader, 1991], the Description Logic ACCtrans, which extends ACC by 
transitive closure, composition, and union of roles, has been introduced, 
and subsumption and satisfiability of ATCtrans-concepts has been shown 
to be decidable. Schild’s observation [Schild, 1991] that ACCtrans is just 
a syntactic variant of Propositional Dynamic Logic (pdl) [Fischer and 
Ladner, 1979] yields the exact complexity of subsumption and satisfiabil- 
ity in ACCtrans: they are ExpTlME-complete [Fischer and Ladner, 1979; 
Pratt, 1979; 1980]. The extension of ACC trans by the inverse constructor 
corresponds to converse PDL [Fischer and Ladner, 1979], which can also 
be shown to be decidable in deterministic exponential time [Vardi, 1985]. 
Whereas this extension of ACCtrans does not change the properties of the 
obtained Description Logic in a significant way, things become more complex 
if both number restrictions and the inverse of roles are added to ACCtrans. 
Whereas ACCtrans and ACCtrans with inverse still have the finite model prop- 
erty, ACCtrans extended by inverse and number restrictions does not. Indeed, 
it is easy to see that the concept 

-i A n 3RT.A n « l R) n \/(R~) + .(3R~.A n l R)) 


is satisfiable in an infinite interpretation, but not in a finite one. Neverthe- 
less, this Description Logic still has an ExpTlME-complete subsumption and 
satisfiability problem. In fact, in [De Giacomo, 1995], number restrictions, 
the inverse of roles, and Boolean operators on roles are added to ACCtrans, 
and ExpTiME-decidability is shown by a rather ingenious reduction to the 
decision problem for ACCtrans. It should be noted, however, that in this work 
only atomic roles and their inverse may occur in number restrictions, and 
that the complement of roles is built with respect to a fixed role any, which 
must contain all other roles, but need not be interpreted as the universal role 
(i.e., A x x A x ). As we shall see below, allowing more complex roles inside 
number restrictions may easily cause undecidability. 
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2.4-2 Expressive number restrictions 

There are three different ways in which the expressive power of number 
restrictions can be enhanced. 

First, one can consider so-called qualified number restrictions, where the 
number restrictions are concerned with role fillers belonging to a certain 
concept. For example, given the role hasChild, the simple number restrictions 
introduced above can only state that the number of all children is within 
certain limits, such as in the concept hasChild n ^5 hasChild. Qualified 
number restrictions can also express that there are at least 2 sons and at 
most 5 daughters: 


^ 2 hasChild. Male n ^ 5 hasChild. Female. 

Adding qualified number restrictions to ACC leaves the important inference 
problems (like subsumption and satisfiability of concepts, and consistency 
of ABoxes) decidable: the worst-case complexity is still PS PACE-complete. 
Membership in PS PACE was first shown for the case where numbers occur- 
ring in number restrictions are written in base 1 representation [Hollunder 
and Baader, 1991a; Hollunder, 1996]. More recently, this has been proved 
even for the case of binary (or, equivalently, decimal) representation of num- 
bers [Tobies, 1999c; 2001b]. The language stays decidable if general sets of 
inclusion axioms are allowed [Buchheit et al, 1993a]. 

Second, one can allow complex role expressions inside number restrictions. 
As already mentioned above, allowing the three Boolean operators and the 
inverse operator in number restrictions of ACC AT leaves us within C 2 , which 
is known to be decidable. In [Baader and Sattler, 1996b; 1999], languages 
that allow composition of roles in number restrictions have been considered. 9 
The extension of ACC by number restrictions involving composition has a de- 
cidable satisfiability and subsumption problem. On the other hand, if either 
number restrictions involving composition, union and inverse, or number 
restrictions involving composition and intersection are added, then satisfi- 
ability and subsumption become undecidable [Baader and Sattler, 1996b; 
1999]. For ACCtrans, the extension by number restrictions involving compo- 
sition is already undecidable [Baader and Sattler, 1999]. 

Third, one can replace the explicit numbers n in number restrictions by 
variables a that stand for arbitrary nonnegative integers [Baader and Sat- 
tler, 1996a; 1999]. This allows one, for example, to define the concept of all 
persons having at least as many daughters as sons, without explicitly saying 


Note that composition cannot be expressed within C 2 . 
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how many sons and daughters the person has: 

Person n ^ a hasDaughter n ^ a hasSon. 

The expressive power of this language can further be increased by intro- 
ducing explicit quantification of the numeric variables. For example, it is 
important to know whether the numeric variables are introduced before or 
after a value restriction. This is illustrated by the following concept 

Person n ja.(VhasChild.(^ a hasChild n ^ a hasChild)), 

in which introducing the numerical variable before the universal value re- 
striction makes sure that all the children of the person have the same number 
of children. Here, ja stands for an existential quantification of a. Univer- 
sal quantification of numerical variables comes in via negation. In [Baader 
and Sattler, 1996a; 1999] it is shown that ALCM extended by such symbolic 
number restrictions with universal and existential quantification of numeri- 
cal variables has an undecidable satisfiability and subsumption problem. If 
one restricts this language to existential quantification of numerical variables 
and negation on atomic concepts, then satisfiability becomes decidable, but 
subsumption remains undecidable. 

2.4-3 Role-value-maps 

Role- value-maps are a family of very expressive concept constructors, which 
were, however, available in the original Kl-One system. They allow one to 
relate the sets of role fillers of role chains. 

Definition 2.30 (Role-value-maps) A role chain is a composition R.\ o 
• • • o R n of role names. If R, S are role chains, then R C S and R = S are 
concepts (role- value-maps). The former is called a containment role- value- 
map, while the latter is called an equality role- value-map. 

A given interpretation 1 is extended to role-value-maps as follows: 

1. (R C S) x = {a G A 1 | V6. (a, b) £ R 1 —> (a, b ) G S x }, 

2. (R = S) x = {«eA I V6. (a, b) G R x <-> (a, b) G S 1 }. 

For example, the concept 

Person n (hasChild o hasFriend C knows) 
describes the persons knowing all the friends of their children, and 
Person n (marriedTo o likesToEat = likesToEat) 
describes persons having the same favorite foods as their spouse. 
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Unfortunately, in the presence of role- value-maps, the subsumption prob- 
lem is undecidable, even if the language allows only conjunction and value 
restriction as additional constructors [Schmid t-Schaufi, 1989] (see also Chap- 
ter 3). 

To avoid this problem, one may restrict attention to role chains of func- 
tional roles, also called attributes or features in the literature. An interpre- 
tation 1 interprets the role R as a functional role iff {(a, b), (a, c)} C R 1 
implies b = c. In the following, we assume that the set of role names is par- 
titioned into the set of functional roles and the set of ordinary roles. Any 
interpretation must interpret the functional roles as such. Usually, we write 
functional roles with small letters /, g, possibly with index. 

Definition 2.31 (Agreements) If /, g are role chains of functional roles, 
then f = g and f A 9 are concepts (agreement and disagreement). 

A given interpretation I is extended to agreements and disagreements as 
follows: 

1- (/ = g) 1 = {a G A 1 I 3b. (a, b) G f 1 A (a, b) G g 1 }, 

2 - (/ # g) 1 = {a G A 1 | 3bi, b 2 - &i ^ b 2 A (a, 6i) G f x A (a, b 2 ) G g 1 }. 

In the literature, the agreement constructor is sometimes also called the 
same-as constructor. Note that, since /, g are role chains between functional 
roles, there can be at most one role filler for a w.r.t. the respective role chain. 
Also note that the semantics of agreements and disagreements requires these 
role fillers to exist (and be equal or distinct) for a to belong to the concept. 

For example, hasMother, hasFather, and hasLastName with their usual in- 
terpretation are functional roles, whereas hasParent and hasChild are not. 
The concept 

Person n (hasLastName = hasMother o hasLastName) 
n (hasLastName A hasFather o hasLastName) 

describes persons whose last name coincides with the last name of their 
mother, but not with the last name of their father. 

The restriction to functional roles makes reasoning in ACC extended 
by agreements and disagreements decidable [Hollunder and Nutt, 1990]. A 
structural subsumption algorithm for the language provided by the Classic 
system, which includes the same-as constructor, can be found in [Borgida 
and Patel-Schneider, 1994] . However, if general inclusion axioms (or transi- 
tive closure of functional roles or cyclic definitions) are allowed, then agree- 
ments and disagreements between chains of functional roles again cause 
subsumption to become undecidable [Nebel, 1991; Baader et al, 1993]. 
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Additional types of role interaction constructors similar to agreements and 
role-value-maps are investigated in [Hanschke, 1992]. 
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Abstract 

We present lower bounds on the computational complexity of satisfiability 
and subsumption in several Description Logics. We interpret these lower 
bounds as coming from different “sources of complexity”, which we isolate 
one by one. We consider both reasoning with simple concept expressions and 
reasoning with an underlying TBox. We discuss also complexity of instance 
checking in simple ABoxes. We have tried to enhance clarity and ease of 
presentation, sometimes sacrificing exhaustiveness for lack of space. 


3.1 Introduction 

Complexity of reasoning has been one of the major issues in the development 
of Description Logics. This is because such logics are conceived [Brachman 
and Levesque, 1984] as the formal specification of subsystems for represent- 
ing knowledge, to be used in larger knowledge-based systems. Since using 
knowledge also means deriving implicit facts from the given ones, the imple- 
mentation of derivation procedures should take into account the optimality 
of reasoning algorithms. The study of optimal algorithms starts from the 
elicitation of the computational complexity of the problem the algorithm 
should solve. Initially, studies about the complexity of reasoning problems in 
Description Logics were more focused on polynomial-time versus intractable 
(np- or coNP-hard) problems. The idea was that a knowledge representation 
system based on a Description Logic with polynomial-time inference prob- 
lems would guarantee timely answers to the rest of the system. However, once 
systems based on very expressive Description Logics with exponential-time 
reasoning problems were implemented [Horrocks, 1998b], it was recognized 
that knowledge bases of realistic size could be processed in reasonable time. 


105 



106 


F. M. Donini 


This shifted most of the complexity analysis to Description Logics whose 
reasoning problems are ExpTiME-hard, or worse. 

This chapter presents some lower bounds on the complexity of basic rea- 
soning tasks in simple Description Logics. The reasoning services taken into 
account are: first, satisfiability and subsumption of concept expressions alone 
(no TBox), then the same reasoning services considering a TBox also, and 
in the last part of the chapter, instance checking w.r.t. an ABox. 

We show in detail some reductions from problems that are hard for com- 
plexity classes NP, coNP, PSpace, ExpTime, and from semidecidable prob- 
lems to satisfiability/subsumption in various Description Logics. Then, we 
show how these reductions can be adapted to other Description Logics as 
well. 

In several reductions, we use tableau expansions to prove the correctness of 
the reduction. Thus, a secondary aim in this chapter is to show how tableaux 
are useful not only in devising reasoning algorithms and complexity upper 
bounds - as seen in Chapter 2 - but also in finding complexity lower bounds. 
This is because tableaux untangle two different aspects of the computational 
complexity of reasoning in Description Logics: 

• The first aspect is the structure of possible models of a concept. Such a structure 
is - in many Description Logics - a tree of individual names, linked by arcs labeled 
by roles. We consider such a tree an AND-tree, in the sense that all branches must 
be followed to obtain a candidate model. Following [Schmidt-Schaufi and Smolka, 
1991], we call each branch of such a tree a trace. Readers familiar with tableaux 
terminology should observe that traces are not tableau branches; in fact, they 
form a structure inside a single tableau branch. 

• The second aspect is the structure of proofs or refutations. Clearly, if a trace 
contains an inconsistency - a clash in the terminology set up in Chapter 2 - 
the candidate models containing this trace can be discarded. When all candidate 
models are discarded this way, we obtain a proof of subsumption, or unsatisfia- 
bility. Hence, the structure of refutations is often best viewed as an OR-tree of 
traces containing clashes. 

Here we have chosen to mark the nodes with AND, OR when considering a 
satisfiability problem; if either unsatisfiability or subsumption is considered, 
AND and OR labels should be exchanged. Before starting with the various 
results, we elaborate on this subject in the next subsection. 


3.1.1 Intuition: sources of complexity 

The deterministic version of the calculus for ACCM in Chapter 2 can be 
seen as exploring an AND-OR tree, where an AND-branching corresponds 
to the (independent) check of all successors of an individual, while an 
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OR-branching corresponds to the different choices of application of a non- 
deterministic rule. 

Realizing that, one can see that the exponential-time behavior of the 
calculus is due to two independent origins: the AND-branching, responsible 
for the exponential size of a single candidate model, and the OR-branching, 
responsible for the exponential number of different candidate models. We 
call these two different combinatorial explosions sources of complexity. 

3. 1.1.1 OR-branching 

The OR-branching is due to the presence of disjunctive constructors, which 
make a concept satisfiable by more than one model. The obvious dis- 
junctive constructor is U; hence ACU is a good sublanguage to see this 
source of complexity. Recall that ACU allows one to form concepts us- 
ing negation of concept names, conjunction n, disjunction U, universal 
role quantification VR.C, and unqualified existential role quantification 
3R. This source of complexity is the same that makes propositional sat- 
isfiability NP-hard: in fact, satisfiability in ACU can be trivially proved 
NP-hard by rewriting propositional letters as atomic concepts, A as n, 
and V as U. Many proofs of coNP-hardness of subsumption were found 
by exploiting this source of complexity ([Levesque and Brachman, 1987; 
Nebel, 1988]), by reducing an NP-hard problem to non-subsumption. In Sub- 
section 3.2.1, we show how disjunction can also be introduced by combining 
role restrictions and universal quantification, and in Subsection 3.2.2 by 
combining number restrictions and role intersection. 

3. 1.1.2 AND-branching 

The AND-branching is more subtle. Its exponential behaviour is due to the 
interplay of qualified existential and universal quantifiers; hence ACS is now 
a minimal sublanguage of ACCM with these features. As mentioned in Chap- 
ter 2 one can see the effects of this source of complexity by expanding the 
tableau {D(x)}, when D is the following concept (whose pattern appears in 
many papers, from [Schmidt-Schaufi and Srnolka, 1991], to [Hemaspaandra, 
1999]) - see Chapter 2 for its general form: 


3P1NP2.yP3.Cn n 
3Pi.VP2.VP3.C12 n 
vPi.( 3 P 2 .vp 3 .c 21 n 
3 P 2 .VP 3 .C 22 n 
VP 2 .(3P 3 .C 3 1 n 
3P 3 .C 32 )). 
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For each level l of nested quantifiers, we use a different role Pi (but us- 
ing the same role R would produce the same results). The structure of 
the tableau for {Z?(x)}, which is the candidate model for D, is a bi- 
nary tree of height 3: the nodes are the individual names, the arcs are 
given by the P^-successor relation, and the branches are the traces in the 
tableau. 

Each trace ends with an individual that belongs to C\ t , C^j , C^, for 
i,j, k £ {1,2}. Hence, a clash may be found independently in each trace, i.e. , 
in each branch of the tree. To verify that this structure is indeed a model, 
one has to check every AND-branch of it; and branches can be exponentially 
many in the nesting of quantifiers. 

This source of complexity causes an exponential number of possible refu- 
tations to be searched through (each refutation being a trace containing a 
clash) . 

This second source of complexity is not evident in propositional calculus, 
but a similar problem appears in predicate calculus - where the interplay 
of existential and universal quantifiers may lead to large models - and in 
Quantified Boolean Formulae. 


Remark 3.1 For Description Logics that are not closed under negation, a 
source of complexity could be absent in satisfiability while it might appear in 
subsumption. This is because C is subsumed by D iff C n — ■ D is unsatisfiable, 
where —>D could belong to a Description Logic which is more expressive than 
the DL of C and D. ■ 


3.1.2 Overview of the chapter 

We first present separately the effect of each source of complexity. In the 
next section, we discuss intractability results stemming from disjunction 
(OR-branching), which lead to coNP-hard lower bounds. We discuss both 
the case of plain logical disjunction (as the Description Logic PC), and 
the case of disjunction arising from alternative identification of individuals 
(AC£J\f). Then in Section 3.3 we present an np lower bound stemming from 
AND-branching, namely a Description Logic in which concepts have one 
candidate model of exponential size. 

A P Space lower bound combining the two sources of complexity is pre- 
sented in Section 3.4, and then in Section 3.5 we show how axioms can 
combine in a succinct way the sources of complexity, leading to ExpTime- 
hardness of satisfiability. 
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In Section 3.6 we examine one of the first undecidability results found for 
a Description Logic, using the powerful construct of role-value-maps - now 
recognized as very expressive, because of this result. 

Finally, we analyze intractability arising from reasoning with individuals 
in ABoxes (Section 3.7), and add a final discussion (Section 3.8) about the 
significance of these results - beyond the initial study of theoretical com- 
plexity of reasoning - also for benchmark testing of implemented procedures. 

Section 3.9, with a list of complexity results for satisfiability and sub- 
sumption, closes the chapter. 

3.2 OR-branching: finding a model 

When the number of candidate models is exponential in the size of the con- 
cepts involved, it is a combinatorial problem to find the right candidate 
model to check. In Description Logics, this may lead to NP-hardness of sat- 
isfiability, and coNP-hardness of subsumption. 

3.2.1 Intractability in T C 

Brachman and Levesque [1984] (see also [Levesque and Brachman, 1987]) 
were the first to point out that a slight increase in the expressiveness of a 
Description Logic may result in a drastic change in the complexity of rea- 
soning. They called this effect a “computational cliff” of structured knowl- 
edge representation languages. They considered the language FC, which 
admits concept conjunction, universal role quantification, unqualified exis- 
tential quantification, and role restriction. For readability, the syntax and 
semantics of FC are recalled in Table 3.1. 

Role restriction allows one to construct a subrole of a role R, i.e. , a 
role whose extension is a subset of the extension of R. For example, the 
role child] ma | e may be used for the “son-of” relation. Observe two prop- 
erties of role restriction, whose proofs easily follow from the semantics in 
Table 3.1: 

1. for every role i?, the role R|t is equivalent to R\ 

2. for every role R, and concepts A , C , D, the concept (V(i?|c)-A) n (V(i?|zj).A) is 
equivalent to V(-R|(c u .d)).A. 

The second property highlights that disjunction - although not explicitly 
present in the syntax of the language - arises from semantics. 

Brachman and Levesque defined also the language FC ~ , derived from F C 
by omitting role restriction. They first showed that for F , subsumption 
can be decided by a structural algorithm, with polynomial-time complexity, 
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Table 3.1. Syntax and semantics of the Description Logic DC. For DC , 

omit role restriction. 



concept expressions 

semantics 

concept name 

A 

C A 1 

concept intersection 

CnD 

c T nD x 

limited exist, quant. 

3R 

{x £ A 1 \3y.(x,y) £ R 1 } 

value restriction 

MR.C 

{x £ A 1 My. (x, y) £ R 1 —> y £ C x } 


role expressions 

semantics 

role name 

P 

C A 1 x A 1 

role restriction 

R\c {0 

r, y) £ A 1 x A 1 (x, y) £ R x A y £ C x } 


similar to the one shown in Chapter 2. Then they showed that subsumption 
in DC is coNP-hard, exhibiting the first “computational cliff” in Description 
Logics. 

Since the original proof of coNP-hardness is somewhat complex, we give 
here a simpler proof, found by Calvanese [1990] . The proof is based on the 
observation that if C\ U • • • U C n = T, then, given a role R and a concept A. 
we have 


(VCRlciM) n • • • n (V(-R|c„M) 

' v '- r I(CiU-uC „)- 4 

VR\t-A 


MR.A. 


(from 2) 

(3.1) 


(3.2) 

(from 1) 

(3.3) 


Moreover, observe that, for every role Q and every concept C, the disjunction 
3Q UMQ.C is equivalent to the concept T. Hence V(-R|3q).H n V(i?|vQ.C')-^ 
is equivalent to MR. A. These observations are the key to the reduction 
from tautology checking of propositional 3DNF formulae to subsumption 
in DC. 


Theorem 3.2 Subsumption in DC is coNP-hard. 

Proof Given an alphabet of propositional variables L = {pi, . . . ,pk}, de- 
fine a propositional formula F = G\M ■ ■ ■ V G n in 3DNF over L, where each 
disjunct Gi is made of three literals l\ A l f A if, and for every i 6 
{1 , ,n}, and j £ {1,2,3}, each literal Ij is either a variable p £ L, or its 
negation p. 
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Given a set of role names {R, P±, . . . , P^} (one role P, for each variable pf) 
and a concept name A. define the concept Cf = (V-Rlc 1 ! • A)n---n(VR\ Cn .A) 
where, for each i G {1, ... . . , n}, Ci is the conjunction of three concepts D- n 
Dj n D - , and each D{ is 


D i = \lp h ' A Vi - for j€ {1,2,3}, iG{l,...,n}. 
I 3P h , if Ij = Ph 

Then the claim follows from the following lemma. 

Lemma 3.3 F is a tautology if and only if Cf = V R.A . 


□ 


Proof The proof of the claim is straightforward; however, since it appears 
only in Calvanese’s Master thesis (in Italian), we present it here in full. 

Only if If I 7 is a tautology, then Ci U • • • U C n = T. This can be shown 
by contradiction: suppose C\ U • • • U C n is not equivalent to T. Then, there 
exists an interpretation I in which there is an element x 0 Cf, for every 
i G {1, . . . , n}. Since each Ci = Dj n Df n Df, it follows that for each i there 
is a j G {1,2, 3} such that Define a truth assignment r to L as 

follows. For each h G {1, . . . , k}: 

• r{ph) = false iff l j = ph , and x fL D\ 

• r{p h ) = true iff l{ = ph, and x ^ Dj. 

Observe that we cannot have both r(pi f) = false and r(pi f) = true at the 
same time, since this would imply both x 0 3Ph and x 0 VPh.A, which is 

impossible since 3Ph U MPp.A = T. Evidently, t assigns false to at least 

one literal for each disjunct of F, contradicting the hypothesis that F is a 
tautology. Therefore C\ U ■ ■ • U C n = T. 

The claim is now implied by equivalences (3.1)— (3.3) . 

If Suppose F is not a tautology. Then, there exists a truth assignment 
t such that for each i e {l,...,n}, there exists a j € {1,2,3} such that 
r(lj) = false. 

Define an interpretation (A 1 , - x ), with A x containing three elements 
x, y, z, such that P ^ = (y, z ) if r(ph) = false, and P = 0 otherwise. More- 
over, let A 1 = 0, and R 1 = {x,y}. 

Observe that in this way, y € ( BPh) 1 iff 7 (p/i) = false, and y € (VPh-A) 1 
iff r(ph) = true. This implies that x fL ifJR.A) 1 . To prove the claim, we now 
show that x G Cp. 
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Observe that, for each i G {1, . . . , ?r}, there exists a j G {1,2, 3} such that 
r(lj) = false. For such a j, we show by case analysis that y 0 (Dj) 1 : 

• if = Ph then Dj = \/P h .A, and in this case, r{p h ) = false, hence y ^ ( \/Ph-A ) x ; 

• if Zf = P/j then D- = 3P/j, and in this case, r{p h ) = true, hence y ^ {3P h ) x . 

Therefore, for every i G {1, . . . , n} we have y 0 Cf. This implies that ( x , y) 0 
^Ifciu-uc )> hence x G (VP|(c; lU ...uc; n ).yl) :r , which is a concept equivalent to 

c F . 1 □ 

The above proof shows only that subsumption in T C is coNP-hard. 
However, role restrictions could also be used to obtain qualified existen- 
tial quantification, since 3R.C = 3R\c- Hence, 3FC contains also the AND- 
branching source of complexity. Combining the two sources of complexity, 
Donini et al. [1997a] proved a PSpace lower bound for subsumption in RC , 
matching the upper bound found by Schmidt-Schaufi and Srnolka [1991] . 


3.2.2 Intractability in RC plus qualified existential 
quantification and number restrictions 

As shown in Chapter 2, disjunction arises also from qualified existential 
quantification and number restrictions. This can be easily seen examining 
the construction of the tableau checking the satisfiability of the concept 

(3R.A) n (3R.(-iA n -.£)) n (3R.B) n ^ 2 R (3.4) 

in which, once three objects are introduced to satisfy the existentials, one 
has to choose between three non-equivalent identifications of pairs of objects, 
where only one identification leads to a consistent tableau branch. 

Remark 3.4 When a Description Logic includes number restrictions, then 
negation of concept names is included for free, at least from a computational 
viewpoint. In fact, a concept name A and its negation -i A can be coded as, 
say, ^ 4 Ra and ^ 3 Ra where Ra is a new role name introduced for A. 
Now these two concepts obey the same axioms as A and -iA - namely, 
their conjunction is _L and their union is T. Hence, everything we say about 
computational properties of Description Logics including RC~ plus number 
restrictions holds also for AC plus number restrictions. ■ 

We now present a proof of intractability based on this property. The reduc- 
tion was first published by Nebel [1988], who reduced the NP-complete prob- 
lem of set splitting [Garey and Johnson, 1979, p. 221] to non-subsumption 
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in the Description Logic of the Back system, which included the basic T C~ 
plus intersection of roles and number restrictions. SET splitting is the fol- 
lowing problem: 

Definition 3.5 (Set splitting) Given a collection C of subsets of a basic 
set S, decide if there exists a partition of S into two subsets Si and S 2 such 
that no subset of C is entirely contained in either Si or S 2 ■ 

We simplify the original reduction. We start from a variant of SET split- 
ting (still NP-complete) in which all c € C have exactly three elements, and 
reduce it to satisfiability in TC~ plus qualified existential role quantification 
and number restrictions. 1 Since role intersection can simulate qualified ex- 
istential role quantification (see next Subsection 3.2.2. 1) this result implies 
the original one. 

Theorem 3.6 Satisfiability in RC~£M is NP -hard. 

Proof Let S = {1, . . . , n}, and let ci, . . . , Ck be the subsets of S. There exists 
a splitting of S iff the concept Di n D 2 n D 3 is satisfiable, where D\ . D 2 , 
D 3 are defined as follows: 


D\ 

= 3R.B i n • • • n 3R.B n 

(3.5) 

d 2 

= \/R.(^2Qi n • • • n ^2Q fc ) 

(3.6) 

d 3 

= ^2 R 

(3.7) 


where each concept Bi codes which subsets element i appears in, as follows: 

Bj I \ j \ iec .3Q j .A i 

and concepts Ai, ... , A n are defined in such a way that they are pairwise 
disjoint - say, for * G {1, ... , 11 } let Ai = fiiR n ^ iR . Intuitively, when 
tableau rules dealing with n and qualified existential quantification are ap- 
plied to Di n D 2 n D 3 (x), one obtains a tableau whose tree structure of 
individual names can be visualized as in Figure 3.1. The rest of the proof 
strictly follows the original one [Nebel, 1988], hence we do not present it 
here. The intuition is that D 3 forces every ys generated by D\ to be iden- 
tified with one of only two successors of the root individual name x. Such 
identifications correspond to the sets ,S) and S^. Then D 2 forces the split of 
each 3-subset, since it makes sure that neither of these successors has more 

1 From Remark 3.4, this DL has the same computational properties of AC£J\f [Donini et al., 
1997a] 
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X 



l 

li 

Qji / 

j S '\Qjk 

/ Qj 2 I 

***\ 

Zij l z *3 

2 z iJk 


Fig. 3.1. The AND-tree structure of the tableau obtained by applying rules for n 
and 3R.C to D\ n Z? 2 n D 3 (x). Applying the rule for < 2 R(x) would lead to several 
OR-branclres (as many as the possible identifications of ys). 


than two <3 ./-successors, and thus both have at least one (5 ./-successor (since 
there are three of them). □ 


We clarify the construction and show its relevant properties by an 
example. 


Example 3.7 Suppose S' = {1, 2, 3, 4}, and let cp = {1,2, 4} , C 2 = {2,3,4}, 
C 3 = {1,3,4}. Applying the tableau rules of Chapter 2 to D i, one obtains 
the following tree of individual names (definitions of each Bi are expanded) : 


R{x,yi) 


R{x,y 2 ) 


Di(x) 


R(x, 2 / 3 ) 


R(x, 2/4) 


£1(2/1) 

£2(2/1) 

£ 3 ( 2 / 1 ) 

£4(2/1) 


f Qi(yi,zu) Ai(zu) 
1 <33(2/1 > ^13) ^-1(^13) 
f Ql( 2 / 2 ) ^21) A 2 ( z 2 i ) 
l <32(2/2, z 22 ) a 2 ( z 22 ) 
f <32(2/3,^32) 243(2:32) 
1 <33(2/3, £33) 243(2:33) 

{ < 3 1 ( 2/4 , £41) ^4(2:41) 
<32(2/4,^42) A 4 (z 42 ) 
<33(2/4,^43) ^ 4 ( 2 : 43 ) 


where the individual names yi, . . . , 2/4 stand for the four elements of S, and 
each Zij codes the fact that element i appears in subset Cj . Because of asser- 
tions Ai(zij), no two 2 :’s disagreeing on the first index - e.g., z 32 and z 42 ~ 
can be safely identified, since they must satisfy assertions on incompatible 
A’s. This is the same as if the constraints z t j / z^j, for all z, h E {1, . . . , |5|} 
with i yf h, and all j E {1, . . . , |C|}, were present. 

Now D 3 states that yi , . . . , y 4 must be identified into only two individual 
names. Observe that identifying 2 / 2 , 2 / 3 , 2/4 leads to an individual name (say, 
y 2 ) having among others, three unidentifiable ( 32 -fillers 2 : 22 ,^ 32 ,^ 42 - But D 2 
states that all R- fillers of x, including y 2 , have no more than 2 fillers for <32- 
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This rules out the identification of y 2,2/3, ija in the tableau. Observe that this 
identification corresponds to a partition of S into {1} and {2,3,4}, which is 
not a solution of SET splitting because the subset C 2 is not split. Following 
the same line of reasoning, one could prove that the only identifications of 
all ii-fillers into two individual names, leading to a satisfiable tableau, are 
one-to-one with solutions of set splitting. ■ 


The same reduction works for non-subsumption, since D\ n D2 n H3 
is satisfiable iff D\ n D 2 is not subsumed by -1.D3 = ^ 3 R. This type of 
reduction has also been applied (see [Donini et al, 1999]) to prove that 
subsumption in ACMT is coNP-hard, where ACMT is the Description Logic 
including AC, number restrictions and inverse roles. 

Observe that also T C~ CM contains the AND-branching source of com- 
plexity, since qualified existential restriction is present. With a more complex 
reduction from Quantified Boolean Formulae, combining the two sources of 
complexity, satisfiability and non-subsumption in ACCM has been proved 
PS PACE-complete by Hemaspaandra [1999]. 

Note that in the above proof of intractability, pairwise disjointness of 
Ai, . . . , A n could be also expressed by conjoining logn concept names and 
their negations in all possible ways. Hence, the proof needs only the concept 
^ 2 R, and when qualified existentials are simulated by subroles, only ^ 1 R 
is used. This shows that the above proof of intractability is quite sharp: 
intractability arises independently of the size of the numbers involved. The 
computational cliff is evident if one moves to having 0 and 1 only in number 
restrictions, which leads to so-called functional roles - since the assertion 
^ 1 R(x) forces R to be a partial function of x. In that case, the tractabil- 
ity of a Description Logic can usually be established, e.g., the Description 
Logic of the system Classic [Borgida and Patel-Schneider, 1994], The in- 
tuitive reason for tractability of functional roles can be found in the corre- 
sponding tableau rules, which for number restrictions of the form ^ 1 R(x) 
become deterministic: there is no choice in identifying individuals names 
yi,...,yk which are all i?-fillers for x, but to collapse them all into one 
individual. 


3.2.2. 1 Simulating 3R.C with role conjunction 

Donini et al. [1997a] showed that a concept D containing qualified existen- 
tial role quantifications 3R.C is satisfiable iff the concept D is satisfiable, 
where in D each occurrence of a concept 3R.C is replaced by the concept 
3(R fl Qc) n V(i? n Qc).C, adding Qc as a new role name (a different 
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Qc for each occurrence of 3R.C, to be used nowhere else). We call D an 
n -simulation of D in the rest of the chapter. 

The proof that the simulation is correct can be easily given by referring 
to tableaux. 

Example 3.8 Considering the concept D below on the left, and simulating 
qualified existential quantifications in D by role intersections, one obtains 
the concept D on the right, 

(3R.A n ~ ( 3(RnQ A ) ny(RnQ A ).An 

D = < 3R.B n D = l 3(R nQ B )n V(R r\Q B ).Bn 

{ MR.C { MR.C 

where subscripts on new role names help to identify which existential they 
simulate. Applying the tableau rules of Chapter 2 to D(x), one obtains the 
model 

R{x,y) A(y) 

Qa(x, y) C(y) 

R(x,z) B(z ) 

Q b (x,z) C(z) 

which satisfies both concepts. ■ 

Proposition 3.9 A concept D is satisfiable iff D is satisfiable. 

Proof The proof of the proposition follows the example. Namely, an open 
tableau branch for D is also an open tableau branch for D (ignoring as- 
sertions on new role names), and an open tableau branch for D can be 
transformed to an open tableau branch for D just by adding the assertions 
about new role names. □ 

As observed by Nebel [1990a], an acyclic role hierarchy in a Description 
Logic can be always simulated by conjunctions of existing roles and new 
role names. In the above example, using two role names Qa , Qb and the 
inclusions Qa E R, Qb E R yields the same simulation. 

Applying fl-simulation, one could obtain from the reduction in Theo- 
rem 3.6 the original reduction by Nebel, proving that satisfiability (and 
non-subsumption) in ACN{Vf) is NP-hard. Using a more complex reduction, 
Donini et al. [1997a] proved that satisfiability in A£A7(n) is in fact PSpace- 
complete. 
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3.3 AND-branching: finding a clash 

When candidate models of a concept have exponential size - as for the 
A££-concept of Subsection 3. 1.1. 2 - models cannot be guessed and checked 
in polynomial time. In this case, it is a combinatorial problem to find 
the clash - if any - in the candidate model. This leads to NP-hardness of 
unsatisfiability and subsumption. However, for many Description Logics the 
AND-tree structure of a model is such that its traces (branches of the AND- 
tree) have polynomial size. A concept C is satisfiable iff there is no trace 
containing a clash; hence it is sufficient to guess such a trace to show that 
C is unsatisfiable. From this argument, Schmidt-Schaufi and Srnolka [1991] 
proved that satisfiability in ACS is in coNP. 


3.3.1 Intractability of satisfiability in ACS 

We now report a proof that satisfiability in ACS is coNP-complete. The 
original proof was based on a polynomial-time reduction from a variant of 
the NP-complete problem one-in-three 3sat [Garey and Johnson, 1979, 
p. 259] . Here we present a proof based on the same idea, but with a slightly 
different construction, relying on a reduction from the NP-complete problem 
exact COVER (xc) [Garey and Johnson, 1979, p. 221], Such a problem is 
defined as follows. 

Definition 3.10 (Exact cover xc) Let U = {ui, . . . ,u n } be a finite set, 
and let At be a family M \, . . . , M m of subsets of U. Decide if there are q 
mutually disjoint subsets M tl , . . . , Mi q such that their union equals U , i.e. , 
M ih n M lk = 0 for 1 < h < k < q, and \J q k=1 M ik = U. 

The reduction consists in associating every instance of xc with an ACS- 
concept Cm, such that M. has an exact cover if and only if Cm is unsat- 
isfiable. It is important to note that, differently from the previous sections, 
here a solution of the NP-complete source problem is related to a proof of the 
absence of a model. In fact, exact covers of M. are related to those traces of 
{Cm{x)} that contain a clash; hence proving the existence of a solution of 
an NP-complete problem is related to a refutation in the target Description 
Logic. 

In the following we assume R to be a role name. We translate M into the 
concept 


C M = Cl n • • • n C[ n n Lfi 
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where each concept C{ represents a subset Mj , and is inductively defined as 


j f 3R.C^ 1 , if either l < n, ui £ Mj or l > n, ui- n £ Mj 

1 1 V7?.C’/ +| . if either Z < n, ui 0 Mj or l > n, u\_ n 0 Mj 

for Z £ {1, ... , 2n} 

and by the base case C*2 n+1 = T. The concept Di is defined by 


Di = Vi?. • • • Vi?.. , _L 

V v / 

2 n 

and each of D 2 ,D 3 , . . . has one universal quantifier less than the previous 
one. 

Intuitively, for every element ui in U there are two corresponding levels 
l, l + n in the concepts C\ ’s, where “level” refers to the nesting of quantifiers. 
The element ui is present in Mj if and only if there is an existential quantifier 
in the concept C( at level l + n - which implies by construction that 3 is 
also at level l. The concept D\ is designed in such a way that a clash for 
{Cvf(x)} can only occur in a trace containing at least 2n+ 1 individual 
names. 


Example 3.11 Consider the following instance of XC: let U = {iq, . . . , U3 }, 
and 

M = {Mi = {iti,it 2 },M 2 = {u 2 ,u 3 },M 3 = {u 3 }}- 

The corresponding „4££-concept Cj^i is given by the conjunction of 
C\ , Cf . C\ and D\ . defined as follows: 

U\ u 2 u 3 U\ u 2 u 3 
Mi ^ Cj = 3R.3R.\f R,.3R.3R.\f R.T 
M 2 — C'l = \/R,3R.3R.\/R.3R.3R. T 
M 3 ^ Cl = VR.VR.3R.VR.VR3R. T 
Di = yR.yR.VR.VR.yR.VR. ± 

where on the left we put the subset Mj corresponding to each C\ , and 
above we put the elements of U corresponding to each level of the concepts. 
Observe that the elements of U appear twice. ■ 

The conjunction of the above concepts is unsatisfiable if and only if the 
interplay of the various existential and universal quantifiers, represented by 
a trace, forces an individual name in the tableau for {C'x(x)} to belong to 
the extension of _L. This reduction creates a correspondence between such a 
trace and an exact cover of U . 
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In order to formally characterize such a correspondence, we define the 
activeness of a concept in a trace. Let T be a trace and C be a concept. We 
say that C is active in T if C is of the form 3R.D and there are individual 
names y. z such that T contains C(y), R(y,z), and D(z). Therefore, an 
existentially quantified concept 3R.D is active in T if the — ►g-rule has been 
applied to the assertion 3R.D(y) in T. Intuitively, if C J k is active in a trace 
of containing a clash, then Uk belongs to an exact cover of M. 

Lemma 3.12 ([Donini et al., 1992a], Lemma 3.1) Let.T be a trace of 
{C M {x)}. 

1. Suppose Cf. is active in T. Then for all l £ {1, . . . , k} if the concept Cf is of 
the form 3R.Cf +1 , then it is active in T. 

2. If T contains a clash, then for every l £ {1, . . . ,2 n} there exists exactly one j 
such that Cf is active in T. 

Example 3.13 The reader can gain an insight into the importance of the 
above properties by constructing the tableau for the concept 

(3R.VR.3R.A) n 
(3R.VR.3R.B) n 
(VR.3R. T) 

and verifying that the trace reaching the concept A has both existentials 
of the first line active (and neither existential of the second line), and vice 
versa for the trace reaching B. ■ 

Example 3.14 (Example 3.11 Continued) Note that in Example 3.11 
the two subsets M\ and M 2 form a (non-exact) cover of U, and indeed, 
the tableau for { C\ nC] n D\(x)} is satisfiable. Moreover, observe the im- 
portance of the two levels. If concepts were formed by just one level, the 
following concepts would be unsatisfiable (choose the highlighted existen- 
tials): 

C\ = 3R.3R.\/R.T 
Cl = V1?.3R.3R.T 
D[ = VR.VR.VR. _L 

corresponding to a cover by M \ and M 2 which is non-exact. The second 
level ensures that once an existential is chosen, all nested existentials must 
be chosen too to form a trace. ■ 
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Theorem 3.15 Unsatisfiability in ACE is NP-hard. 

Proof We show that an instance (U, M.) of xc has an exact cover if and 
only if Cm is unsatisfiable. Let M. = {Mi, . . . , M m } be a set of subsets of 
U and Cm = C\ n • • • n C™ n D\ be the corresponding concept. Since this 
proof is the base for three others in the chapter, we present it in some detail. 

Only if Let M tl , . . . , M lq be an exact cover of U. Let T be a trace of 
{Cm{x i)} defined inductively as follows: 

T\ = {C{(x i) | j G {1, . . .,m}} U {Di(xi)} 

Ti+i = Ti U {R(xi,xi +1 )} U {C J l+1 (x i+ i) | u i+ i <E Mj) U {Di +1 (x i+1 )}- 

Obviously, T = T 2 n+ i contains a clash, because D 2 n +i = -L. For each level 
l there is exactly one j such that C 3 = 3R.C 3 +l . Using this fact, one can 
easily show that T is a trace by induction on l. 

If If Cm is unsatisfiable, then there exists a trace T of {Cm{%)} such that 
T contains a clash. We show that the subsets in 

{Mj | 31 E {1, . . . , n} : C 3 n+l is active in T} 

form an exact cover of U. First of all, since T is a trace, for every level 
l £ {1, . . . , 2n} there exists a j such that C 3 is active in T (Lemma 3.12(h)). 
Hence the union of these subsets covers U. 

We now prove that no two subsets overlap: in fact, suppose there are i, j 
such that Mi, Mj intersect non-trivially in an element ui. Here we exploit 
the two-layered construction of Cm- By definition, there are h, k such that 
C' n+h and C 3 are active in T. Since ui is in both Mj and Mj, by construc- 
tion of Cm we have C\ = 3R.C\ +1 and Cj = 3R.C 3 l+1 . From Lemma 3.12(i), 
we know that Cl and C 3 are both active in T. Hence i = j from 
Lemma 3.12(h). □ 

The above reduction works also for the special case of xc in which every 
subset has at most three elements, which corresponds to at most six nested 
existential quantifications in each concept C\ . Hence, bounding the num- 
ber of nested existential quantifications by a constant k > 6 does not yield 
tractability. The original reduction from one-in-three 3sat shows more- 
over that bounding the number of existentials in each level by a constant 
k > 3 does not yield tractability. 

Simulating qualified existential quantifications in Cm by role intersec- 
tion (see Subsection 3. 2. 2.1), we conclude that unsatisfiability of concepts 
in AC{\f) - AC plus role conjunction - is NP-hard, too. 
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Theorem 3.16 Satisfiability and subsumption of concepts are NP-hard in 
AC( n). 

We note that this source of intractability is not due to the presence of the 
concept _L, but to the interplay of universal and existential quantification. In 
fact, the above reduction works also for the Description Logic FC~£, which 
is F C~ plus qualified existential quantification. 

Theorem 3.17 Subsumption is np -hard in FC~£ . 

Proof The proof is based on the reduction given for ACS. The „4££-concept 
C'M = C\ n • • • n C[ n n D\ in that reduction is unsatisfiable if and only if 
C\ n • • • n C™ is subsumed by -i D\. Now C\ n • • • n C\ n is a concept in 
FC~ £ and ~<Di can be rewritten to the equivalent concept E, defined as 

E = 3R.--3R. T 

' V ' 

2 n 

i.e. , a chain of 2 n qualified existential quantifications terminating with the 
concept T. Obviously, E is in FC~ S, hence subsumption in FC~S is np- 
hard. □ 

We now use the above construction to show that in three other Description 
Logics - extending F C~ with each pair of role constructs for role conjunc- 
tion, role inverse, and role chain - subsumption is NP-hard. The fact that 
reductions can be easily re-used is a characteristic of Description Logics. 
It depends on the compositional semantics of constructs - hardness proofs 
obviously carry over to more general Description Logics - but also on the 
extensional semantics, that allows one to simulate a construct with others. 


3.3.2 FC plus role conjunction and role inverse 

We abbreviate this Description Logic as FC~( n,“ ). We prove that 
FC~{ n, _ ) is hard for np by an argument similar to that for FC~£. 
One may be tempted to use Ll-simulation, defined in Subsection 3. 2. 2.1, 
which replaces qualified existential quantifications by role intersections. 
However, a direct Id-simulation of the concepts used in the reduction 
for FC~£ does not work. In fact, n-simulation preserves satisfiabil- 
ity, not subsumption; e.g., while 3R.C n D is subsumed by 3R.C, its 
Id-simulation 3(R n Qi) n MQ\.C n D is not subsumed by 3(i?nQ2)LI 
VQ 2 .C. 



122 


F. M. Donini 


To carry over the proof, it is useful to have a tableau rule for role inverse: 


Condition T contains R{x,y ), 

where R is either a role name P or its inverse P ~ ; 
Action T' = T U {R~(y,x)}, 

where if R = P~ , then R~ = P. 


Theorem 3.18 Subsumption in PC (n, ) is np -hard. 

Proof We refer to the concept Cm defined in the reduction given for ACS. 
Let n be the cardinality of U in xc. First define the concept F as follows: 

F = \/R. ■ ■ ■ \/R. \/(R~). ■ ■ • V(.R _ ). A 
" v v ' 

2n 2n 

where A is a concept name (recall that Cm does not contain any concept 
name but T and _L). F is a concept of PC~( n, _ ). 

Observe now that the A££-concept Cm = C\ FI • • • FI C J” n D\ is unsat- 
isfiable if and only if Cl n • • • n C™ n F is subsumed by A (where C is the 
□-simulation of C). In fact, the subsumption holds if and only if the com- 
plete tableau for {C\ n • • • n C™ n F(x), -o4(a;)} contains the only possible 
clash {A(x), -A(.t)}. This tableau contains a clash if and only if there is a 
trace of length 2 n in the tableau, and such a trace is in one-to-one corre- 
spondence with the exact covers of the problem xc. Hence subsumption in 
PC~(np ) is NP-hard. □ 


3.3.3 PC plus role conjunction and role chain 

We abbreviate this Description Logic as PC~{ n,o). 

Theorem 3.19 Subsumption inPC~{ n,o) is NP -hard. 

Proof Again, we refer to the concept Cm defined in the reduction given for 
ACS. Observe that the A££-concept Cm = C\ n . . . n C f 1 n D\ is unsat- 
isfiable if and only if Cj n . . . n C\ n is subsumed by (again, C is the 
n-simulation of C ). The claim holds, since Cl n . . . n C™ is in PC~{ n) and 
^D\ can be expressed as the equivalent concept G, defined as follows: 

G = 3 (R o • ■ ■ o R) . 

' V " 

2m. 


(3-8) 
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Obviously, G is in ^ 7 £“(o), hence subsumption in RL~{C\,o) is NP-hard. □ 

We note that in the above reduction, subsumption is proved intractable by 
using only role conjunction in the subsumee (to simulate existential quantifi- 
cation), and only role chain in the subsumer. We will exploit the subsumer 
(3.8) also in Subsection 3.3.4. 


3.3.4 3F C plus role chain and role inverse 

We abbreviate this Description Logic as We first show that, 

similarly to Subsubsection 3. 2. 2.1, qualified existential quantifications in a 
concept D can be replaced by a combination of role chains and role inverses, 
obtaining a new concept D that is satisfiable iff D is. 

3.3.4- 1 Simulating 3R.C via role chains and role inverses 

Donini et al. [1991b; 1999] showed that a concept D containing qualified 
existential role quantifications 3R.C is satisfiable iff the concept D is sat- 
isfiable, where in D each occurrence of a concept 3R.C is replaced by the 
concept 3(R o Qc ) n V(i2 o Q c o Q^).C, adding Qc as a new role name (a 
different Q for each occurrence of 3R.C, to be used nowhere else). We say 
that C is a o -simulation of C. 

This simulation too can be explained by referring to tableaux, through an 
example concept. 

Example 3.20 Consider the concept D below on the left, and its o- 
simulation D on the right: 

(3R.A n ^ ( 3(RoQ A )n\/{RoQ A oQ~).An 

D = l 3R.B n D = < 3(R o Q B ) n V(R o Q B o Q~).B n 

(VR.C 

where subscripts on new role names help to identify which existential they 
simulate. Applying the tableau rules of Chapter 2 to D(x), one obtains the 
model 

R{x,y) A(y) Q A (y,u y ) 

C(y) 

R(x, z) B(z) Qb{z,u z ) 

C(z) 

where subscripts on individuals u y , u z highlight that there is a new in- 
dividual name for each individual name used to satisfy an existential 
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quantification. That is, the number of individual names in the tableau for 
D is at most twice that in the tableau for D. ■ 

Lemma 3.21 Let D be an ACS-concept and D its o- simulation. Then D 
is satisfiable if and only if D is satisfiable. 

Proof The proof extends the above example. In one direction, an open 
tableau for D is also an open tableau for D (ignoring assertions on new 
role names). In the other direction, an open tableau for D can be trans- 
formed to an open tableau for D: to every role assertion R(x, y ) - added to 
satisfy an existential 3R.C in D - chain an assertion Qc(y,u y )- □ 

If C is an A££-concept, its o-simulation C is a concept belonging to 
the language AC{o~ ), that is, AC plus role inverses and role chains. Of 
course, o-simulations could be defined for concepts belonging to Description 
Logics more expressive than ACC. For Description Logics in which every 
concept is satisfiable (like T C~(o~ )) this simulation can be interesting only 
in subsumptions. 

We can now come back to subsumption in the Description Logic J-C~ 
plus role inverses and role chains. 

Theorem 3.22 Subsumption in PC~{o~ ) is nv-hard. 

Proof For every A££-concept C, one can compute in quadratic time an o- 
simulation C. For a given instance ( U,JA ) of xc, Cm is unsatisfiable iff (by 
Lemma 3.21) Cm is satisfiable iff C\ n • • • n Cf l is subsumed by —iD\. Now 
the subsumee contains no negated concept, hence it belongs to FC~{ o~ ). 
The subsumer is equivalent to the concept G in (3.8), which again is in 
JFC-(o-). □ 


3.4 Combining sources of complexity 

In a Description Logic containing both sources of complexity, one might 
expect to code any problem involving the exploration of polynomial-depth, 
rooted AND-OR graphs. The computational analog of such graphs is the 
class APTime (problems solved in polynomial time by an alternating Turing 
machine) which is equivalent to PSpace (e.g., see [Johnson, 1990, p. 98]). 
A well-known PS PACE-complete problem is Validity of Quantified Boolean 
Formulae: 
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Definition 3.23 (Quantified Boolean Formulae QBf) Decide the va- 
lidity of the (second-order logic) closed sentence 

(QiX 1 )(Q 2 X 2 ) • • • (Q„X n )[F(X!, ...,X n )\, 

where each Qi is a quantifier (either V or 3) and F(X i, . . . , X n ) is a Boolean 
formula with Boolean variables X \, . . . , X n . 

The problem remains PSPACE-complete if F is in 3CNF, i.e. , conjunctive 
normal form with at most three literals per clause. We call the string of 
quantifiers the prefix of the quantified formula, and the 3CNF formula F its 
matrix. 

This problem can be encoded in an AND-OR graph, using AND-nodes 
to encode V-quantifiers, and OR-nodes for 3-quantifiers. In the leaves, there 
is the matrix F. We use this analogy to illustrate the reduction, taken from 
[Schmidt-Schaufi and Snrolka, 1991]. 


3.4-1 PSPACE -hardness of satisfiability in ACC 

Without loss of generality, we assume that each clause is non-tautological, 
i.e., a literal and its complement do not appear both in the same clause. Let 
F = Gi A ■ ■ ■ A G m . The QBF {QxXfi) ■ ■ ■ (Q n X n )[Gi A • • • A G m ] is valid iff 
the ACC - concept 

C = D n C\ n • • • n Cf (3.9) 

is satisfiable, where in C all concepts are formed using the concept name A 
and the atomic role name R. The concept D encodes the prefix, and is of the 
form Di n Vi?.(Z ?2 13 Vi?.(. . . (D n _i n VR.D n ) . . .) where for i € {1, . . . , n} 
each Dj corresponds to a quantifier of the QBF in the following way: 

_ f (3 R.A) n (3i2.-u4), if Qi = V 
* “ \ 3i?.T, if Qi = 3. 

The concept C\ is obtained from the clause Gi using the concept name A 
when a Boolean variable occurs positively in Gi, —< A when it occurs nega- 
tively, and nesting l universal role quantifications to encode the variable A/. 
In detail, let k be the maximum index of all Boolean variables appearing in 
Gi . Then, for l e {1, . . . , (/c— 1)} one defines 

{ Vi?. (A U C l l+ , ) , if Xi appears positively in Gi 

Vi?.(-iA U C'i + j ) , if Xi appears negatively in Gi 
V/?X] +1 , if Xi does not appear in Gi 
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and the last concept of the sequence is defined as 


J MR. A, if Xk appears positively in G* 

\ \/R.-<A, if Xk appears negatively in Gj. 


It can be shown that each trace in a tableau branch for D corresponds 
to a truth assignment to the Boolean variables, and that all traces of a 
tableau branch correspond to a set of truth assignments consistent with the 
prefix. Therefore, Schmidt-Schaufi and Srnolka conclude that satisfiability 
in ACC is PSPACE-hard. Combining this result with the polynomial-space 
calculus given for ACCM in Chapter 2, one obtains that satisfiability (and 
subsumption) in ACCM are PS PACE-complete, and that the exponential- 
time behavior of the calculus cannot be improved unless PSpace =PTime. 
Satisfiability and subsumption are still in PSpace if role conjuctions are 
added to ACCM [Donini et al, 1997a], or if inverse roles and transitive roles 
are added to ACC [Horrocks et al, 2000b]. 

Using ri-simulations, one can use the same reduction to prove that both 
satisfiability and subsumption in ACU{r\) are PSPACE-hard (and thus 
PSPACE-complete). By a more complex reduction, Donini et al. [1991a] 
proved that satisfiability in A£A7(n) is also PSPACE-hard. Hemaspaan- 
dra [1999] proved that satisfiability in ACCM is PSPACE-hard using a re- 
duction from QBF, where the prefix was coded with a concept similar to D 
(more precisely, similar to the concept D in Subsubsection 3. 1.1. 2), and the 
matrix was coded in a more complex way. Also XC was proved PSPACE-hard 
in [Donini et al., 1997a]. Observe that all these Description Logics contain 
both sources of complexity. 


3.4-2 A remark on reductions 

Schild [l99l] observed that ACC is a notational variant of multi-modal logic 
K, whose satisfiability was proved PSPACE-hard by Ladner [1977], using 
a different reduction from qbf. This gives us the occasion to point out a 
characteristic of reductions from a different, fairly experimental viewpoint. 

The target modal formula in Ladner’s reduction has size quadratic w.r.t. 
the given instance of QBF, while one can observe that the concept G in (3.9) 
has only linear size. From a theoretical perspective of the PSpace reduction, 
this is irrelevant. However, QBF has also been studied from an experimental 
point of view (e.g., [Cadoli et al., 2000; Gent and Walsh, 1999]): trivial cases 
have been identified, easy-hard-easy patterns have been found, and one can 
use ratios of clauses / variables for which the probability that a random QBF 
is valid is around 0.5 - which have been proved experimentally to contain the 
“hard” instances. This experimental work can be transferred to Description 
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Logics, to compare the various algorithms and systems for reasoning in ACC. 
This transfer yields the benefits that: 

• concepts which are trivially (un)satifiable do not need to be isolated again; 

• the translation of “hard” QBFs can be used to test reasoning algorithms for ACC', 

• the performance of algorithms for ACC can be compared with best known algo- 
rithms for solving QBF (see [Cadoli et a, l, 2000; Rintanen, 1999; Giunchiglia et al., 
2001]), and optimizations can be carried over. 

However, using Ladner’s reduction to obtain “hard-to-reason” concepts, the 
quadratic blowup of the reduction soon makes the resulting concepts too 
big to be significantly tested. Using Schmidt-Schaufi and Snrolka linear re- 
duction, instead, one can use a spectrum of “hard” concepts as wide as the 
original instances of QBF. Thus, experimental analysis might make signifi- 
cant differences between (theoretically equivalent) polynomial many-to-one 
transformations used in reductions [Donini and Massacci, 2000]. 


3.5 Reasoning in the presence of axioms 

In this section we consider the impact of axioms on reasoning. Intuitively, 
axioms introduce new concept expressions in every individual generated in 
a tableau, so that simple arguments on termination and complexity based 
on the nesting of operators do not apply. We start with a comparison with 
Dynamic Logic, and then we show how axioms can encode a succinct repre- 
sentation of AND-OR graphs, leading to an ExpTime lower bound. 


3.5.1 Results from Propositional Dynamic Logic 

Propositional Dynamic Logic (pdl) [Harel et al., 2000] is a formalism able 
to express propositional properties of programs. Instead of introducing yet 
another logical syntax, we will talk about PDL in terms of Description Logics. 
A precise correspondence between Description Logics and pdl can be found 
in Chapter 5. 

The counterpart of pdl in Description Logics is ACCtrans [Baader, 1991], 
already defined in Chapter 2. We recall that ACCtrans is ACC plus a rich set 
of role constructors: union of roles, composition, and transitive closure. To 
be precise, pdl has also a role-forming constructor which is role identity, and 
the closure of a role is the reflexive-transitive one, denoted as R* . Reflexive- 
transitive closure is defined similarly to transitive closure, but considering 
also every pair (a, a) to be in the interpretation of R* . However, Schild [1991] 
showed that these are minor differences, as long as we are concerned with 
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computational behavior only. 

pdl and ACCtrans are relevant in this section about axioms, because us- 
ing union and transitive closure of roles, one can “internalize” axioms in a 
concept in the following way [Baader, 1991; Schild, 1991]. Let C be an ACC 
concept, T a set of axioms of the form Cj C i e {1, . . . , rn} . Observe 
that every axiom can also be thought of as a concept —> C U D which every 
individual in a model must belong to. Let Ri, , R n be all the role names 
used in either C or T. Then C is satisfiable w.r.t. T iff the following concept 
is satisfiable: 

Cn V(Ri u • • • u R n )* u Di) n • • • n (-. C m u D, m )). (3.10) 

The key property that makes this reduction correct is the connected model 
property [Streett, 1982]: if C has a model w.r.t. a set of axioms, then it 
has also a model in which one element a £ A x is in C x , and for every other 
element b in the model, there is a path of roles from a to b. 

Concept (3.10) is just a syntactic variant of a pdl expression. Hence, every 
upper bound on complexity of satisfiability for pdl applies also to concept 
satisfiability in ACC w.r.t. axioms, including all role constructors of pdl. 
Namely, satisfiability in pdl was proved to be decidable in deterministic 
exponential time, first by Pratt [1979], and then by Vardi and Wolper [1986] 
using an embedding into tree automata. This upper bound holds also for 
ACC plus axioms. It is interesting to observe that the deterministic ex- 
ponential time upper bound was nontrivial; simple nondeterministic upper 
bounds were proved by Fischer and Ladner [1979] for PDL and by Buchheit 
et al. [1993a] for Description Logics, using tableaux. Only recently a tableau 
with lemmas providing a deterministic exponential upper bound has been 
found [Donini and Massacci, 2000]. 

Regarding hardness, every lower bound on reasoning in ACC with axioms 
carries over to pdl. However, lower bounds for PDL were already known. 
Fischer and Ladner [1979] proved that pdl is ExpTiME-hard using a reduc- 
tion from Alternating Turing Machines working in polynomial space (recall 
that the complexity class Alternating Polynomial Space is the same as Ex- 
pTime [Johnson, 1990]). Van Ernde Boas [1997] proved the same result 
using a reduction from alternating domino games. However, both hardness 
proofs use a very small part of PDL, and in particular, transitive closure on 
roles appears only in one expression of the form (3.10), so that proofs could 
be adapted to ACC concept satisfiability w.r.t. a set of inclusions, in a very 
simple way. Moreover, the proofs use MR.C to code an AND-node, and 3R.C 
to code an OR-node. Hence, they follow the same intuition presented in the 
previous section, where we showed the correspondence between AND-OR 
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trees and satisfiability of ACC without axioms. 

Here, we want to present yet another proof, of a very different nature, 
that highlights the fact that concept inclusions can express a large structure 
in a succinct way. 


3.5.2 Axioms and succinct representations of AND-OR graphs 

We now need more precise definitions about AND-OR graphs. An AND-OR 
graph is a graph in which nodes are partitioned into AND-nodes and OR- 
nodes. An OR-node is reachable if one of its predecessors is reachable (as in 
ordinary graphs), while an AND-node is reachable only if all its predecessors 
are reachable. 

Definition 3.24 (AND OR graph Accessibility Problem (agap)) 
Given an AND-OR graph, a set of source nodes Si, ... , S m , and a target 
node T, is T reachable from Si , ... , S' m ? 

Let n be the number of nodes of the graph, and d (a constant) the max- 
imum number of predecessors of a node. It is well known that AGAP can 
be solved in time polynomial in n (e.g., it can be reduced to Monotone 
Circuit Value, which is PTlME-complete [Papadimitriou, 1994]). However, 
AGAP becomes ExpTlME-complete when one considers its succinct version 
[Balcazar, 1996]. Let the out-degree of a node be bounded by a constant d. 
Let C be a Boolean circuit with logn inputs, and with 1 + dlogn outputs; 
when the input of C is the binary encoding of a node N, its outputs are 
the encodings of the type of N (AND/OR) and of the d predecessors of N 
(using a dummy node if there are fewer than d predecessors). 

Definition 3.25 (Succinct AND-OR Graph Accessibility Prob- 
lem (s(agap))) Given a circuit C representing an AND-OR graph, a 
set of source nodes Si,... ,S m , and a target node T, is T reachable from 
Si,...,S m ? 

Now, s(agap) is ExpTlME-complete [Balcazar, 1996]. The intuition for this 
exponential blowup in complexity is that there are many circuits which can 
encode graphs whose size is exponentially larger than the circuit size. This 
intuition applies to many other succinct representations of problems with 
circuits [Papadimitriou, 1994, p. 492] or with propositional formulae [Veith, 
1997], yielding complete problems for high complexity classes. 
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We reduce s(agap) for graphs with in-degree d = 2 to unsatisfiability 
of an ACC concept C w.r.t. a set of inclusions T . Intuitively, the axioms 
can succinctly encode either a proof of unsatisfiability for a concept, or 
a model for C w.r.t. T . We note that, since we are coding reachability 
into unsatisfiability, we will use n to code OR-nodes - a conjunction is 
unsatisfiable when at least one of its conjuncts is - and LI to code AND- 
nodes. 

First of all, let Ai, . . . , A] og n be a set of concept names one-to-one with 
the inputs of the circuit C. Each node A in the graph is then mapped into 
a conjunction of As and their negations, denoted by concept(N ) , depending 
on the code of A: if the ith bit in the code of A is 1, use A,;, if it is 0, use — Aj. 
For example, if A has code 1101 then concept(N) is A\ n A -2 n - 1 A 3 n A 4 . 

Then, let B\, . . . , B 1 ogn and B 2 , . . . , B 2 n be two sets of concept names 
one-to-one with the outputs of C. Conjunctions of B s with negations code 
predecessor nodes. 

Moreover, let two concept names AND, OR represent the type of a 
graph node. If C has k internal gates, we use also k concept names 
W\, . . . , Wk- For each gate, we use a concept equality that mimics the 
Boolean formula defining the gate. E.g., if C has an A-gate x\ A xi = X 3 , 
we use the equality X\ n X2 = A3, where Ai, X2, A3 can either be con- 
cept names among W\ , ... ,W p. denoting input/output of internal gates, 
or be some of the As and B s, denoting inputs/outputs of the whole cir- 
cuit. 

For the output of C encoding the type of the node, we use directly the two 
concept names AND , OR in the concept equality coding the output gate of 
C. Moreover, to model the different interpretation of predecessors for the 
two type of nodes, we use the inclusions 

AND C 3R 1 .TU3R 2 .T (3.11) 

OR C 3R 1 .TH3R 2 .T (3.12) 

where R 1 and R 2 are two role names (we use indices 1,2 to parallel indices 
of the B s). Observe that concept AND implies a disjunction U, and con- 
cept OR implies a conjunction n. This is because we reduce reachability 
to unsatisfiability, as we said before. Moreover, observe that predecessors in 
the AND-OR graph are coded into role successors in the target Description 
Logic. 

For the output of C encoding the predecessors of a node, we add the 
following inclusions for i 6 { 1 , . . . , log n}: 

B] C MR 1 . Ai 


(3.13) 
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- B } 

C 

Vi? 1 .-! A* 

(3.14) 

Bf 

C 

MR 2 .Ai 

(3.15) 

- B 1 

c 

Vi? 2 .-iAj. 

(3.16) 


We denote by 7c the set of all of the above axioms. 

We now give an example of what the axioms imply. Suppose C computes 
the two predecessors 1011 and 0110 for node 1101. Then, equalities coding C 
force conceptf 1101) = A\ n Ai n -'A 3 n A 4 to be included in B\, ~'B\ , B\ , 
B\ (first predecessor) and -1 Bf, B 2 , B 2 , B 2 (second predecessor). Then 
inclusions (3.13)-(3.16) say that every i? 1 -successor is included in A\, -> A 2 , 
A 3 , A 4 - which conjoined, make concept^ 1011 ) - and that every i? 2 -successor 
is included in -u4i, A 2 , A 3 , - 1 A 4 (concept (0110)) . Moreover, if C computes an 
AND-type for node 1101, then axiom (3.11) implies that the corresponding 
concept is included in AND, and this implies that either an ^-successor or 
an i? 2 -successor exists. For OR- type nodes, both successors exist. 

Theorem 3.26 Let C be a circuit, T be the target node, and Si,...,S m 
be the source nodes in an instance of s(agap). Then T is reachable from 
Si, , S m iff concept(T) is unsatisfiable in the TBox 7c U {concept(S\) C 
_L} U ■ ■ ■ U {concept(S m ) C _L}. 

Proof Most of the rationale of the proof has been informally given above. 
We sketch what is needed to complete the proof. 

If Suppose T is unreachable from S±, . . . , S m . We construct a model 
(X, A x ) for concept(T) satisfying the axioms as follows. Let A x be the set of 
all nodes in the graph which are unreachable from Si, ... , S m . Then, (R 1 ) 1 
is the set of pairs (a, b) of nodes in A 1 , such that b is the first predecessor of 
a, and similarly for (R 2 ) 1 (second predecessor). For i £ {1, . . . , logn}, (Ai ) 1 
is the set of nodes in A x whose binary code has the ith bit equal to 1. The 
interpretation of the B s, IFs, and AND, OR concepts is according to the 
1 -value of the circuit: node a is in their interpretation iff the output they 
correspond to is 1 when the code of a is the input of the circuit. 

Then, T £ (concept(T)) x , and moreover (X,A x ) satisfies by construction 
all axioms in 7c; e.g., if an OR-node is unreachable, then both its predeces- 
sors are unreachable, hence both predecessors are in A 1 , and axiom (3.12) 
is satisfied. Similarly for an AND-node. 

Only if Let N be any node reachable from Sj , S m , and let d(N) be 
the depth of the shortest hyperpath leading from 5j , . . ; S m to N. We show 
by induction on d(N) that concept(N) is unsatisfiable in the TBox. 
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If d{N ) = 0, the claim holds by construction. Let N be a reachable node, 
with d(N) = k + 1. If IV is an OR-node, at least one of its predecessors - let 
it be the first predecessor, and call it M - is reachable with d(M ) = k. Then 
concept (M) is unsatisfiable by inductive hypothesis. But axiom (3.12) im- 
plies that concept(N) is included in BRVT n 3i? 2 .T, while (3.13)-(3.16) im- 
ply that concept(N) is included in V R 1 .concept(M) , that is, Vitd.-L. Hence, 
also concept(N) is unsatisfiable. A similar proof holds in case N is an AND- 
node. 

Then, the claim holds for N = T. □ 

Observe that in the above proof we did not use qualified existential quan- 
tification; hence, the proof works for the sublanguage of ACC called ACU. 
Now, axioms coding the circuit can be propositionally rewritten without 
union. Moreover, the only other axiom in which union is needed is (3.11), 
which could be rewritten equivalently as Vi? 1 . _L n Vi? 2 .T C — i OR, which is 
now in the language AC. 

Theorem 3.27 Let C be a concept and T a set of inclusions in AC, with 
at least two role names. Deciding whether C is unsatisfiable w.r.t. T is 
ExpTime -hard. 

The above theorem sharpens a result by Calvanese [1996b], who proved 
ExpTiME-hardness for ACU. McAllester et al. [1996] proved ExpTime- 
hardness for a logic that includes T C~ C, and their proof can be rewritten 
to work with ACU. 

Remark 3.28 The above proof does not follow the correspondence used 
by Fischer and Ladner [1979] between AND-nodes and \/R.C concepts on 
one side, and OR-nodes and 3R.C concepts on the other side. There, quan- 
tifications 3R and \/R.C were used to code predecessors in the graph, node 
type was coded by n, U constructors, while axioms were crucial to mimic 
the behavior of the circuit. ■ 


3.5.3 Syntax restrictions on axioms 

In the proof, no restriction on axioms was imposed. A significant syntactic 
restriction is to allow only concept names on the left-hand side of axioms. 
In this case, a dependency graph induced by the axioms of a TBox T can 
be constructed, whose nodes are labeled by concept names. A node A is 
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connected to a node B if the concept name B appears (also as a subconcept) 
in a concept C. and A C C is an axiom. Then, it makes sense to distinguish 
between cyclic axioms, in which the dependency graph contains a cycle, and 
acyclic axioms. 

Acyclicity is significant, because if only acyclic axioms are allowed, then 
reasoning in ACC can be performed in PSpace by expanding axioms when 
needed [Baader and Hollunder, 1991b; Calvanese, 1996b]. The only case for 
ACC (till now) in which acyclic axioms make reasoning ExpTlME-hard is 
when concrete domains are also added [Lutz, 2001b]. 

Also sublanguages of ACC can be considered. With regard to acyclic ax- 
ioms in AC, Buchheit et al. [1998] proved that subsumption in acyclic AC 
TBoxes is coNP-hard, and in PSpace. Calvanese [1996b] proved that cyclic 
axioms in AC are PSPACE-complete, and other results for ACS and ACU. 

A second possible restriction is to allow axioms of the form A = C, but in 
which a concept name can appear only once on the left-hand side. For axioms 
of this form in ACM , Kiisters [1998] proved that reasoning is PSpace- 
complete when the TBox is cyclic, and NP-complete when it is acyclic. 


3.6 Undecidability 

One of the main reasons why satisfiability and subsumption in many De- 
scription Logics are decidable - although highly complex - is that most 
of the concept constructors can express only local properties about an ele- 
ment [Vardi, 1997; Libkin, 2000]. Let C be a concept in ACC\ recalling the 
tableau methods in Chapter 2, an assertion C(x) states properties about x , 
and about elements which are linked to x by a chain of at most \C\ role 
assertions. Intuitively, this implies that a constraint regarding x will not 
“talk about” elements which are arbitrarily far (w.r.t. role links) from x. 
This also means that in ACC, and in many Description Logics, an assertion 
on an individual cannot state properties about a whole structure satisfying 
it. However, not every Description Logic satisfies locality. 


3. 6. 1 Undecidability of role-value-maps 

The first notable non-local Description Logic is a subset of the language 
of the knowledge representation system Kl-One, isolated by Schmidt- 
Schaufi [1989], which we call 2 J-C~( o, =). It contains conjunction, univer- 
sal quantification, role composition, and equality role-value-maps R = Q. A 


In his paper, Schmidt- SchauB used the name AC1Z. 


2 
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Table 3.2. Syntax and semantics of the Description Logic TC (o, C). 



concept expressions 

semantics 

concept name 

A 

C A 1 

value restriction 

MR.C 

{x e A 1 Vy. ( x , y) € R 1 —■ > y € C 1 } 

concept intersection 

CUD 

C 1 AD 1 

role- value-map 

RCQ 

{x& A 1 \ My. (x, y) e R 1 -> ( x , y) G Q 1 } 


role expressions 

semantics 

role name 

P 

C A 1 x A x 

role composition 

RoQ 

{{x,y) G A 1 x A 1 | 3 z. (x,z) G R 1 , 
(z, y) G Q 1 } 


role- value-map allows one to express concepts like “persons whose co-workers 
coincide with their relatives”, as it could be, e.g., a small family-based firm. 
Using two role names co-worker and relative, this concept would be expressed 
as (co-worker = relative n person). 

The Description Logic proved undecidable by Schmidt-Schaufi used equal- 
ity role- value-maps. Here we present a simpler proof for a Description Logic 
using containment role-value-maps R C Q. We call this Description Logic 
tFjC,~(o, U). Clearly, ^ 7 £ _ (o, C) is (slightly) more expressive than iFC~{ o, =), 
since R = Q can be expressed by (R C Q) n (Q C R), but not vice versa. 
Most of the original reduction is preserved, though. 

Although all constructs of 3FC~{ o, C) have already been defined in differ- 
ent parts of Chapter 2, we recall for convenience their syntax and semantics 
in the single Table 3.2. Recall that R C Q is a concept: namely, the concept 
of all elements whose set of fillers for role R is included in the set of fillers 
for role Q. To avoid many parentheses, we assume o has always precedence 
over C. 

Before giving the proof that subsumption in iFC~ (o,C.) is undecidable, 
let us consider an example illustrating why 3FC~{ o, C) is not local. 

Example 3.29 Let Q, R, S, U, V be role names. Consider whether the con- 
cept C = VS.VU.A n (R o Q C S) n \/R.(Q o U C V) is subsumed by the con- 
cept D = VR.VQ.VU.B. 

The answer is no: in fact, a model satisfying C and not satisfying D 
is shown in Fig. 3.2. This model can be obtained by trying to satisfy 
-i D = 3R.3Q.3U.^B with individual x,y,z,w, and then adding role asser- 
tions satisfying C. Observe that a model of C cannot be a tree because 
of concepts like (Ro Q C S). Hence, any notion of “distance” between two 
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Fig. 3.2. A possible countermodel for CCD in Example 3.29. Boxes group asser- 
tions about an individual; arrows represent role assertions. 

individuals in a model, as the number of role links connecting them, is am- 
biguous when a Description Logic has role- value- maps. Moreover, the sat- 
isfaction of the assertions (R o Q C S)(x) and VS.A(x) in an interpretation 
depends on the satisfaction of the assertion A(z), for every individual z con- 
nected to x via a path of role fillers that can be composed according to 
role- value-maps. In fact, replacing B with A in D yields a concept D' which 
now subsumes C - and indeed, the previous model satisfies also D' . ■ 

These properties are crucial for the reduction from ground rewriting sys- 
tems to subsumption in TC~{ o,C). For basics about rewriting systems, 
consult [Dershowitz and Jouannaud, 1990]. 

Definition 3.30 (Ground rewriting system) Let S be a finite alphabet 
{a,b , . . .}. A term w on S is an element of £*, i.e. , a finite sequence of 0 
or more letters from S. If v,w are terms, their concatenation is a term, 
denoted by vw. A ground rewriting system is a finite set of rewriting rules 

P = {si L}i=i,...,n> where for every i € {1, , n} both s* and L; are terms 

on S. The rewriting relation A induced by a set of rewriting rules p is 
the minimal relation which is reflexive, transitive, and satisfies the following 
conditions: 

1. if s — > t € p then s —> t; 

2. for every letter a £ S, if p A q then both ap —* aq and pa A qa. 

The rewriting problem for ground rewriting systems is: Given a set of rewrit- 
ing rules p and two terms v , vj , decide whether v — > w. 

Remark 3.31 In general, a single rewriting step of a term v consists in 
finding a substring of v which coincides with the antecedent s of a rewriting 
rule s — > t, and then substituting t for s in v. Hence, v —> w if there exist n 
terms u\, . . . , u n such that u\ = v, u n = w, and for each i G {1, . . . , n — 1} 
the two terms iq, Ui + \ are such that for some terms p and q. we have u* = psq, 
Ui - |_i = ptq. and s — ► t € p. This proves that the term problem is recursively 
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enumerable. However, it is senridecidable (recursively enumerable, but non- 
recursive) . ■ 

We reduce this problem to subsumption in XX _ (o,C) as follows. First 
of all, observe that we can define the following one-to-one correspondence 
between terms and role chains: 

• For every letter a in S, let P a be a role name. 

• For every term w, let R w be the composition of the role names corresponding to 
the letters of w. For example, if w = aab , then R w = P a o P a o P b . 

Now for each set of rewriting rules p, we define the concept C p as 

C p = I C Ri). 

Let Q be a new atomic role: we define a concept Cy as 

Cy = riaesCQ 0 P a ^Q)- 

Intuitively, if a model X satisfies C'y(x), then for every term w, if (Q o 
R w )(x, z ) holds in X, then Q(x , z) also holds, i.e. , x is directly connected via 
Q to every other element 2 : to which it is indirectly connected via Q o R w . 

If also X |= VQ.C p (x), then C p (z ) holds for every such z. This is a key 
property of the reduction. 

Remark 3.32 The two concepts MQ.C P and Cy are a way to internalize 
simple axioms in a concept. Consider a TBox X = {T C C p } which states 
that every individual in a model must satisfy concept C p . One could prove 
that in XX - (o, C) a concept C is subsumed by a concept D w.r.t. X iff Cy n 
\/Q.C p n MQ.C is subsumed by \/Q.D, where the latter is plain subsumption 
between concept expressions. ■ 

Theorem 3.33 Subsumption in XX~( o,C) is undecidable. 

Let p be a set of rewriting rules, and v, w be two terms. Define the following 
two concepts: 

C = Cy n VQ.Cp (3.17) 

D = VQ.(R v C R w ). (3.18) 

We divide the proof in two lemmas. 

Lemma 3.34 If v w then the concept C is subsumed by D. 
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Proof We first prove that the claim holds for the base case of the inductive 
definition of A (Condition (i) in Definition 3.30). Then, we prove the claim 
for the two inductive cases (Condition (ii)). Finally, we prove that the proof 
carries over the closure conditions. In all cases, let s — > t € p. 

Base case The concept D is \/Q.(R s C R t ). Observe that the concept 
MQ.Cp is equivalent to I \ s ^t£pVQ-{Rs ^ Rt)- Hence, C is subsumed by D 
because D is one of the conjuncts of (an equivalent form of) C. 

Inductive cases For the first inductive case, let D = VQ.(P a o R p C P a o 
R q ), and let the inductive hypothesis be that C is subsumed by \/Q.R p C R q . 
For a contradiction, suppose C is not subsumed by D: then, there is a model 
X in which both C{x) and ~^D(x) hold. The latter constraint implies that 
there is an element y such that: 

1. X |= Q(x,y) 

2- X |= (P 0 o R p )(y, z) 

3. l\£(P a oR q )(y,z). 

From 2, there is an element y' such that both P a (y,y') and R p (y',z ) 
hold. Now from C'e(x'), we must have X |= Q(x,y r ), and from the induc- 
tive hypothesis this implies (R p C R q )(y'). Then, X |= R q (y',z ) holds, hence 
X |= (P a o R q )(y, z), contradicting 3. 

The second inductive case is simpler, since one does not need to consider 
C%(x). The interested reader can use it as an exercise. 

We conclude the proof by showing that the reduction carries over the 
reflexive and transitive closure of A. 

First, from the semantics in Table 3.2 it follows that R w C R w is equivalent 
to T, which implies also that D = T. Hence the claim holds also for w w 
(i.e. , reflexivity) . 

For transitivity, the induction is easy: suppose u — > v and v —> w: then 
by induction C is subsumed by D\ and by X> 2 , where D\ = VQ.(R U C R v ) 
and Z ?2 = VQ.(R v C R w ) . Then C is subsumed also by D\ n D -2 which is 
equivalent to VQ.((R U C R v ) n (R, v C R w )). This concept is subsumed by 
\/Q.(R u C R w ), which is the claim. □ 

We now prove the other direction of the reduction. 

Lemma 3.35 If v yC w, then the concept C is not subsumed by D. 

Proof We give the rule for constructing an infinite tableau branch X and 
show that it defines a model that satisfies C, and does not satisfy D. The 
tableau is one-to-one with an infinite automaton accepting the term v, and 
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every other term into which v can be rewritten. Let v [1], . . . , v[n] denote the 
n letters of v (v [i] is the ?'th letter of v). 

Let x, y , 2 i be individual names. Start from the set of assertions 

% Pv [ 1 ] (?/> J/l) > ■ • ‘ > -^[i+l] (yi i Ui + 1 )>■••) Pv [n] (ijn—l ? 2 ) • 

Then add role assertions to T following the — >c-mle: 

Condition there is a rewriting rule s — > t £ p 

where s = s[l] • • • s[/i] and t. = t[l] • • • f[fc]; 

T contains h + 1 individuals yo,- .. ,yn and h assertions 
P s [i\{yi-I,yi) for i <E {1, . . .,h} 

T does not contain all assertions P t [i](yo, 2/i), • • • , Pt[k\{y'k-n yh) 
Action T' = Tll {P t [i]{yo, y'i), ■ ■ ■ , Pt[k\(y'k-i> yh)}: 

where y[, , y' k _i, are k — 1 individual names not occurring in T. 

Intuitively, if there is in T a path of role assertions such that R s (yo, yh) holds, 
the — >c-rule adds another path such that also Rt(yo,yh ) holds. Of course, 
%j can have an infinite number of individuals and role assertions between 
them; this is reasonable, since its role paths from y to z are one-to-one 
with the possible transformations on v one can make using the rewriting 
rules. One can also think of as an infinite-state automaton accepting 
v = {u | v u}. 

The — >c-nrle always adds new assertions to T, and its application given 
some premises does not destroy other premises of application of the — >c-rule 
itself, since we keep in T all the rewritten terms. Therefore, the construction 
is monotonic over the C-lattice of all tableaux with a countable number 
of individuals and role assertions between individuals. Hence there exists 
a fixpoint T.^. In building T v, however, a fair strategy must be adopted. 
That is, if at a given stage % of the construction, the — >c-rule is applicable 
for individuals yo, • • • , yh, then for some finite k, in % + k the — >c-rule has 
been applied for those premises - i.e., a possible rule application is not 
indefinitely deferred. This could be achieved by, e.g., inserting possible rule 
applications in a queue. 

Proposition 3.36 Let be constructed using the — >c -rule, and a fair 
strategy. For every term u = w[l] • • • u[k\, v ^ u iff in there are k — 1 in- 
dividual names yi , . . . , yk-i and k assertions P u [\](y , yi), . . . , P u \u{yk- 1 , z). 

Proof If v u, then there are a minimum finite number n of applications 
of rewriting rules in p transforming v into u. By induction on such n, the 
premises of the — >c~rule are fulfilled, and since 7^ is built adopting a fair 
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strategy, from some finite stage of its construction onwards, R u (y,z ) must 
hold. For the other direction, if R u (y, z) holds in then for each — »c-rule 
application leading to R u (y,z ) one can apply a rewriting rule to v, leading 
to u. □ 

We can now define the model Z satisfying C and not satisfying D. Let 
N be the set of individual names of T w . Z has domain {x} U N . Let Z = 
U {Q(x, y)\y 6 N}. Then Z satisfies C(x) straightforwardly; moreover, it 
does not satisfy D from Proposition 3.36. □ 

To prove that subsumption is undecidable in the less expressive Descrip- 
tion Logic JF£-(o,=), Schmidt-Schaufi [1989] started from the word prob- 
lem for groups. Starting from the Post correspondence problem, with a more 
complex construction, Patel-Schneider [1989b] proved that subsumption is 
also undecidable in the more expressive Description Logic PL~( o, C) plus 
role inverses, functional roles, and role restrictions. 

Starting from the word problem - which is less general than the term 
rewriting problem, but still semidecidable - Baader [1998] showed that sub- 
sumption in PC~(o, C) is undecidable without referring to tableaux. We 
report here the second part of his proof (corresponding to Lemma 3.35), 
since it is quite short and elegant, and shows a different way of proving the 
only-if direction, namely, giving a direct definition of an infinite structure 
satisfying the concepts. 

The word problem follows Definition 3.30, but considers the reflexive- 
symmetric-transitive closure A of rewriting rules. This is also known as 
the word problem for semigroups, or Thue systems. In this case, ground 
term and word are synonyms. Of course, is an equivalence relation on 
words; let [u] denote the ^-equivalence classes. Note that [«] = [u] iff u v. 
There is a natural multiplication on these classes induced by concatenation: 
[it][u] = [uu] (since A is even a congruence, this is well-defined). 

Taking the equivalence classes plus one distinguished element x as the 
domain of the model X, the roles can be interpreted as 

Q J = {(x,[u])|«eS*} (3.19) 

(Pa) 1 = {([«], [ua] ) | a S*}. (3.20) 

Then, it can be shown that if v w, then x belongs to C x but not to D x , 
as follows: 

1. x belongs to C x \ from (3.20), for every word u we have ( x , [u]) £ Q x 
and ([a], [mo]) £ (P a ) X \ but also from (3.19), (a:, [ua]) £ Q x , hence Cs(x) is 
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satisfied by X. Regarding VQ.C p (x), suppose ([u], [iy]) G (4? s ) x , where s — > t G p. 
Then [iy] = [us] by definition of (Pa ) 1 ■ Moreover, from s-»fGp it follows 
that us 44 ut, hence [us] = [ut]. Consequently, (M,M) = (M,M) G (R t ) x 
from (3.20). 

2. x does not belong to D x : for the empty word e, [e] is a Q-filler of x; however, [e] 
does not satisfy the concept R v C R w . In fact, ([e], [u]) G (R V ) X ', but ([e], [u]) ^ 
(R w ) x , since [ru] is the only R^-filler of [e] but [y] [to] from the assumption 
that v yG w. 


3.7 Reasoning about individuals in ABoxes 

When an ABox is considered, the reasoning problem of instance checking 
arises: Given an ABox A, an individual a and a concept C, decide whether 
A |= C(a ). For the instance check problem, the size of the input is formed 
by the size of the concept expression C plus the size of A. Since the size of 
one input may be much larger than the other in real applications, it makes 
sense to distinguish the complexity w.r.t. the two inputs - as is usually done 
in databases with data complexity and query complexity [Vardi, 1982], 

A common intuition [Schmolze and Lipkis, 1983] about instance checking 
was that it could be performed via subsumption, using the so-called most 
specific concept (msc) method. 

Definition 3.37 (Most specific concepts) Let A be an ABox in a given 
Description Logic, and let a be an individual in A. A concept C is the most 
specific concept of a in A, written msc(A , a), if, for every concept D in the 
given Description Logic, A \= D(a ) implies CCD. 

Recall from Chapter 2 a slightly different definition of msc in the realiza- 
tion problem: given an individual a and an ABox A, find the most specific 
concepts C (w.r.t. subsumption) such that A \= C(a ) [Nebel, 1990a, p. 104]. 
Since conjunction is always available in every Description Logic, the two def- 
initions are equivalent (just conjoin all specific concepts of realization into 
one msc). 

Clearly, once msc(A, a) is known, to decide whether a is an instance of a 
concept D it should be sufficient to check whether msc(A,a ) is subsumed 
by D, turning instance checking into subsumption. Moreover, when a TBox 
is present, off-line classification of all rnsc’s in the TBox may provide a way 
to pre-compute many instance checks, providing an on-line speed-up. 

The intuition about how to compute msc(A , a) was to gather the con- 
cepts/properties explicitly stated for a in A. However, this approach is 
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quite sensitive to the Description Logic chosen to express msc(A, a) and the 
queries. In fact, most specific concepts can be easily computed for simple 
Description Logics, like AC. However, it may not be possible when slightly 
more expressive languages are considered. 


Example 3.38 A simple example (simplified from [Baader and Kiisters, 
1998]) is the ABox made just by the assertion R(a , a). If T C~ is used for most 
specific concepts and queries, then msc({R(a , a)}, a) = 3R. However, if qual- 
ified existential quantification is allowed for most specific concepts, then each 
of the concepts 3R, 3R.3R, 3R.3R.3R, . . . , is more specific than the previ- 
ous one. Using this argument, it is possible to prove that msc({R(a, a)}, a) 
has no finite representation, unless transitive closure on roles is also allowed. 
Using the axiom A C 3R.A in an ad hoc TBox, msc({R(a, a)}, a) = A for 
the simple ABox of this example - but this does not simplify instance check- 
ing. An alternative approach would be to raise individuals in the language 
to express concepts, through the concept constructor {. . .} that enumerates 
the individuals belonging to it (called “one-of” in Classic). In that case, 
msc({R(a, a)}, a) = 3i?.{a} (see [Donini et al. , 1990]). But this “solution” 
to instance checking becomes now a problem for subsumption, which must 
take individuals into account (for a treatment of Description Logics with 
one-of, see [Schaerf, 1994a]). ■ 


The msc method makes an implicit assumption: to work well, the size 
of msc(A,a ) should be comparable with the size of the whole ABox, 
and in most cases much shorter. However, consider the Description Logic 
ACS, in which subsumption is in np. Then, solving instance checking 
by means of subsumption in polynomial space and time would imply 
that instance checking was in NP, too. However, suppose that we prove 
that instance checking is hard for conp. Then, solving instance check- 
ing by subsumption implies that either conp C np, or msc(A,a), if ever 
exists, has superpolynomial size w.r.t. A. The former conclusion is un- 
likely to hold, while the latter would make unfeasible the entire method 
of rnsc’s. 

In general, this argument works whenever subsumption in a Description 
Logic belongs to a complexity class C, while instance checking is proved hard 
for a different complexity class C , for which C' C C is believed to be false. 
We present here a proof using this argument, found by Schaerf [1993; 1994b; 
1994a]. 

We first start with a simple example highlighting the construction. 
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Example 3.39 Let f,c\,C2,x,y,z be individuals, R,P,N be role names, 
and A be a concept name. Let A be the following ABox, whose structure 
we highlight by means of arrows between assertions: 



R(f,C2) 


/ P(ci,x) 
\ N(ci,y) 
S P(c 2 ,y) 
\ N(c 2 ,z) 


A(x ) 


The query 3R.(3P.A n 3N.—>A)(f) is entailed by A. That is, one among ci 
and C 2 has its P-filler in A and its IV-filler in -> A. This can be verified by 
case analysis on y: in every model either A(y) or —>A(y) must be true. For 
models in which A(y) holds, C 2 is the P-filler of / satisfying the query; for 
models in which -*A(y) holds, c\ is. Observe that if ACE is used to express 
most specific concepts, the best approximation we can find for msc(A,f), 
by collecting assertions along the role paths starting from /, is the concept 
C = 3R.(3P.A n 3N) n 3R.(3P n 3N.->A), in which the fact that the same 
individual y is both the IV-filler of 3N and the P-filler of 3P is lost. Indeed, 
C is not subsumed by the query, as one can see by constructing an open 
tableau for C n —i3R.(3P.A n 3N.-^A){f). ■ 


The above example can be extended to a proof that deciding A \= C(a), 
where C is an A££-concept, is coNP-hard. Observe that this is a different 
source of complexity w.r.t. unsatisfiability in ACE. In fact, a concept C is 
unsatisfiable iff {C'(a)} |= _L(a). This problem is NP-complete when C is a 
concept in ACE (Subsection 3.3.1). 

The source coNP-complete problem is the complement of 2+2-sat, which 
is the following problem. 


Definition 3.40 (2+2-sat) Given a 4CNF propositional formula P, in 
which every clause has exactly two positive literals and two negative ones, 
decide whether P is satisfiable. 


The problem 2+2-sat is a simple variant of the well-known 3-SAT. Indeed, 
for 3-literal clauses mixing both positive and negative literals, add a fourth 
disjunct, constantly false; e.g., X V Y V —> Z is transformed into the 2+2- 
clause IVbV -i Z V -itrue. Unmixed clauses can be replaced by two mixed 
ones using a new variable (see [Schaerf, 1994a, Theorem 4.2.6]). 

Given an instance of 2+2-sat F = C\ A C *2 A • • • A C n . where each clause 
Ci = L\ + V LC V ~'P\_ V we construct an ABox Af as follows. Af 

has one individual l for each variable L in P, one individual c* for each clause 
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Ci, one individual / for the whole formula F, plus two individuals true and 
false for the corresponding propositional constants. 

The roles of Af are Cl (for Clause), Pi,P -2 (for positive literals), TVj , N 2 
(for negative literals), and the only concept name is A. Finally, Af is given 
by (we group role assertions on first individual to ease reading): 


Cl(f,c i) 


Cl(f,c n ) 


< P2(C1,11+) 

Ni(ci,l\_) 

(n 2 (c. iA_) 

| Aftrue), -■ A(false). 

' Pl(Cn,l?+) 

P2(C n ,q + ) 

' iVl(Cn,^-) 

N 2 (c n ,q_) 


Now let D be the following, fixed, query concept: 


D = 3Cl.((3P 1 .^A) n (3P 2 .^A) n (3Ni.A) n (3N 2 .A)). 

Intuitively, an individual name l is in the extension of A or —>A iff the propo- 
sitional variable L is assigned true or false, respectively. Then, checking 
whether Af |= D(f) corresponds to checking that in every truth assign- 
ment for F there exists a clause whose positive literals are interpreted as 
false, and whose negative literals are interpreted as true - i.e. , a clause that 
is not satisfied. If one applies the above idea to translate the two clauses 
(having just two literals each) false V -i Y, Y V -it rue, one obtains exactly 
the ABox of Example 3.39. 

The correctness of this reduction was proved by Schaerf [1993; 1994a]. We 
report here only the concluding lemma. 


Lemma 3.41 A 2+2-CNF formula F is unsatisfiable if and only if Af \= 

D(f)- 

Hence, instance checking in ACS, is coNP-hard. This implies that instance 
check in ACS cannot be efficiently solved by subsumption, unless coNP C np. 
We remark that only the size of Af depends on the source formula F . while 
D is fixed. Hence, instance checking in ACS is coNP-hard with respect to 
knowledge base complexity - and it is also NP-hard from Subsection 3.3.1. 
The upper bound for knowledge base complexity of instance checking in 
ACS is in n* but it is still not known whether the problem is n^-complete. 
Regarding combined complexity - that is, neither the size of the ABox nor 
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that of the query is fixed - in [Schaerf, 1994a; Donini et al, 1994b] it was 
proved that instance checking in ACS is PSPACE-complete. 

Since the above reduction makes use of negated concept names, it may 
seem that coNP-hardness arises from the interaction between qualified 
existential quantification and negated concept names. However, all that is 
needed are two concepts whose union covers all possible cases. We saw in 
Subsection 3.2.1 that 3R and MR.B have this property. Therefore, if we 
replace A and —<A in Af with 3R, and VR.B, respectively (where R is a new 
role name and B is a new concept name), we obtain a new reduction for 
which Lemma 3.41 still holds. Hence, instance checking in 3FC~ S (i.e. , ACS 
without negation of concept names) is coNP-hard too, thus confirming that 
coNP-hardness is originated by qualified existential quantification alone. In 
other words, intractability arises from a query language containing both 
qualified existential quantification, and pairs of concepts whose union is 
equivalent to T. Hence, for languages containing these constructs, the msc 
method is not effective. 

Regarding the expressivity of the language for assertions in the ABox, 
coNP-hardness of instance checking arises already when assertions in the 
ABox involve just concept and role names. However, note that a key point 
in the reduction is the fact that two individuals in the ABox can be linked 
via different role paths, as / and y were in Example 3.39. 


3.8 Discussion 

In this chapter we analyzed various lower bounds on the complexity of rea- 
soning about simple concept expressions in Description Logics. Our presen- 
tation appealed to the intuitive notions of exploring AND-OR trees, in the 
special case when the tree is derived from a tableau. 

We remark that an alternative approach to reasoning is to reduce it to 
the emptiness test for automata (e.g., [Vardi, 1996]), which has been quite 
successfully applied to temporal logics, and propositional logics of programs. 
However, till now such techniques have been used to obtain upper bounds 
in reasoning, while in order to obtain lower bounds one would need a way to 
reduce problems on automata to unsatisfiability/subsumption in Description 
Logic. The only example of this reduction is [Nebel, 1990b], for a very simple 
Description Logic, which we have omitted in this chapter for lack of space. 

We end the chapter with a perspective on the significance of the np, coNP, 
and P Space complexity lower bounds we have presented. Present reasoning 
systems in Description Logics (see Chapter 8) can now cope with reasonable 
size ExpTlME-complete problems. Hence the computational complexity of 
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the problems now reachable is above PSpace. However, in our opinion, for 
implemented systems the significance of a reduction lies not just in the the- 
oretical lower bound obtained, but also in the reduction itself. In fact, when 
experimenting with algorithms for subsumption, satisfiability, etc. [Baader 
et ai, 1992b; Hustadt and Schmidt, 1997] on an implemented system, one 
can exploit already known “hard” cases of a source problem like 3-SAT, 
2+2-sat, set splitting, or qbf validity to obtain “hard” instances for the 
algorithm under test. These instances isolate the influence of each source of 
combinatorial explosion on the performance of the overall reasoning system, 
and can be used to optimize reasoning algorithms in a piecewise fashion 
[Horrocks and Patel-Schneider, 1999], separately for the various sources of 
complexity. In this respect, the issue of finding “efficient” reductions (w.r.t. 
the size of the resulting concepts) is still open, and can make the difference 
when concepts to be tested scale up (see [Donini and Massacci, 2000]). 


3.9 A list of complexity results for subsumption 
and satisfiability 

Many names have been invented for languages of different Description Log- 
ics, e.g., TC for Frame Language, ACC for Attributive Descriptions Lan- 
guage with Complement, etc. Although suggestive, these names are not very 
explicit about which constructs are in the named language. This makes the 
huge mass of results about complexity of reasoning in Description Logics 
often difficult to screen by non-experts in the field. To clarify the constructs 
each language is equipped with, we use two lists of constructors: the first 
one for concept constructors, and the second one for role constructors. For 
example, the pair of lists (n, 3R, \/R.C ) (n, o) denotes a language whose con- 
cept constructors are conjunction n, unqualified existential quantification 
3R, and universal role quantification Vi?. C, and whose role constructors are 
conjunction n and composition o. Many combinations of concept construc- 
tors have been given a name which is now commonly used. For instance, 
the first list of the above example is known as J- C~ . In these cases, we fol- 
low a syntax first proposed in [Baader and Sattler, 1996b], and write just 
RC~ {C\, o) - that is, fFC~ augmented with role conjunction and composition 
- to make it immediately recognizable by researchers in the field. 


3.9.1 Notation 

In the following catalog, satisfiability and subsumption refer to the problems 
with plain concept expressions. When satisfiability and subsumption are 
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w.r.t. a set of axioms, we state it explicitly. Moreover, when the constructs 
of the Description Logic allow one to reduce subsumption between C and D 
to satisfiability of C n -> D, we mention only satisfiability. 

In the lists, we have tried to use the symbol of the DL construct when- 
ever possible. We have abbreviated some constructs, however: unqualified 
number restrictions ^ nR, ^ nR are denoted by ^ R, while qualified num- 
ber restrictions ~^nR.C, ^ nR.C are ^ R.C . When a construct is allowed 
only for names (either concept names in the first list, or role names in the 
second one) we apply the construct to the word name. 


3.9.2 Subsumption in PTime 

To the best of the author’s knowledge, no proof of PTiME-hardness has been 
given for any Description Logic so far. Therefore the following results refer 
only to membership in PTlME. 

• (n ,3R,\/R.C) () known as T Cr [Levesque and Braclnnan, 1987]. 

• (n ,3R,\/R.C,^(name)) () known as AC [Sclrmidt-SchauB and Smolka, 1991]. 

• (n ,3R,\/R.C,^ R) () known as ACAf [Donini et al. , 1997a]. 

• AC(o), AC(~) [Donini et al, 1999]. 

• FC~(n) [Donini et al., 1991a]. 

• (n , 3 R.C, {individual}) (fl, _ ) known as SCIRO 1 [Baader et al., 1998b], 


3.9.3 NP and coNP 

• (n ,3R.C,\/R.C,~>(name)) () (known as ACS) subsumption and unsatisfiability 
are NP-complete [Donini et al, 1992a] (see Subsection 3.3.1). 

• yfL(n), ACS(p), and (n,3 R.C,\/R.C) () (known as ACR, ACSTZ and T C~ S 
respectively) subsumption and unsatisfiability are NP-complete [Donini et al., 
1997a] (see Theorems 3.16, 3.17 for hardness, and [Donini et al., 1992a] for 
membership) . 

• (n, U, 3R, MR.C, -i(name)) () (known as ACU) subsumption and unsatisfiability 
are coNP-complete [Donini et al., 1997a] (see Subsection 3. 1.1.1). 

• ACAf{~) subsumption is coNP-complete, while satisfiability is decidable in poly- 
nomial time [Donini et al., 1999]. 

• !FC~( n, - ), !FC~{ n, o), and !FC~{ o,~ ) [Donini et al., 1999] (see Subsections 3.3.2, 
3.3.3, and 3.3.4). 

• ACO, satisfiability w.r.t. a set of acyclic axioms is coNP-hard [Buchheit et al., 
1994a; Calvanese, 1996b; Buchheit et al., 1998] (coNP-complete for AC£{) [Cal- 
vanese, 1996b]). 
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(n, U, -t, 3R.C, \/R.C) () (known as ACC) [Schmidt-SchauB and Smolka, 1991] (see 
Subsection 3.4.1). 

(n, -^{name) ,3R.C , \/R.C,ffR) () (known as ACE M) [Hemaspaandra, 1999]. 
FC~(R\c) (known as TC ), ACM(n), ACU{ n), (n ,3R.C,VR.C,*$^R) (n) 
(known as ACC MRS) [Donini et al., 1997a]. 

ACC(n,U,o) satisfiability [Massacci, 2001]. Membership is nontrivial. 

AC£() satisfiability w.r.t. a set of cyclic axioms is PSPACE-complete [Calvanese, 
1996b], 

ACM if) satisfiability w.r.t. a set of cyclic axioms of the form A = C, where each 
concept name A can appear only once on the left-hand side, is PSPACE-complete 
[Keisters, 1998]. 


3.9.5 ExpTime 

AC w.r.t. a set of axioms (see Section 3.5 for hardness). 

(n, U, -i, 3R.C, \/R.C) (l_l,o ,* ,id(),~) which includes ACC trans [Baader, 1991; 
Schild, 1991]. Membership is nontrivial, and was proved by Pratt [1979] without 
inverse, and by Vardi and Wolper [1986] for converse- PDL, reducing the problem 
to emptiness of tree automata. 

(n, U, 3R.C, VR.C, ^ name.C, ^ name~ .C) (U, o,* ,“ , idQ), known as 

ACCQJ re g (see Chapter 5). Membership is nontrivial. 

(n, U, 3R.C, \/R.C, px.C[x\, {individual}) (“), where px.C[x] denotes the least 
fixpoint of x [Sattler and Vardi, 2001]. Membership is nontrivial. 


3.9.6 NExpTime 

adding concrete domains (see [Baader and Hansclrke, 1991a]), satisfiability in 
ACC w.r.t. a set of acyclic axioms, and ACC{~) [Lutz, 2001a]. 

ACC(C\, U, '-i) satisfiability [Lutz and Sattler, 2001]. 

(n, U, 3R.C, \/R. C, {individual}, ^R.C) () satisfiability [Tobies, 2001b]. 

(n, U, -1, 3R.C, VR.C, <> R) (n) (known as ACCMR) satisfiability w.r.t. a set of 
axioms (only membership was proved) in [Buchheit et al., 1993a]). 


3.9.7 Undecidability results 

tFC~( o,=), which is a subset of the language of the knowledge representation 
system Kl-One [Schmidt-SchauB, 1989] (see Subsection 3.6.1 for undecidability 
of RC~(o,C) ). 

tFC~{ o,C,~ , functionality , R\c) 1 which is a subset of the language of the knowl- 
edge representation system NlKL [Patel-Schneider, 1989a]. 



148 


F. M. Donini 


• (),(n,o,-i) (known as U) [Schild, 1989]. 

• ACCJ\f( o,U,“ ), ACCAf{o,U) satisfiability w.r.t. a set of axioms [Baader and Sat- 
tler, 1999]. 
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Abstract 

In this chapter, we are concerned with the relationship between Description 
Logics and other formalisms, regardless of whether they were designed for 
knowledge representation issues or not. We concentrate on those representa- 
tion formalisms that either (1) had or have a strong influence on Description 
Logics (e.g., modal logics), (2) are closely related to Description Logics for 
historical reasons (e.g., semantic networks and structured inheritance net- 
works), or (3) have similar expressive power (e.g., semantic data models). 
There are far more knowledge representation formalisms than those men- 
tioned in this chapter. For example, “verb-centered” graphical formalisms 
like those introduced by Simmons [1973] are not mentioned since we believe 
that their relationship with Description Logics is too weak. 


4.1 AI knowledge representation formalisms 

In artificial intelligence (AI), various “non-logical” knowledge representation 
formalisms were developed, motivated by the belief that classical logic is in- 
adequate for knowledge representation in AI applications. This belief was 
mainly based upon cognitive experiments carried out with human beings and 
the wish to have representational formalisms that are close to the represen- 
tations in human brains. In this section, we discuss some of these formalisms, 
namely semantic networks, frame systems, and conceptual graphs. The first 
two formalisms are mainly presented for historical reasons since they can be 
regarded as ancestors of Description Logics. In contrast, the third formalism 
can be regarded as a “sibling” of Description Logics since both have similar 
ancestors and live in the same time. 
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- has skin 

Animal o ■ can move around 



4-1.1 Semantic networks 

Semantic networks originate in Quillian’s semantic memory models [Quil- 
lian, 1967], a graphical formalism designed to represent “word concepts” in 
a definitorial way, i.e. , similar to the one that can be found in an encyclo- 
pedia definition. This formalism is based on labeled graphs with different 
kinds of edges and nodes. Besides others, Quillian’s networks allow subclass- 
superclass edges, and and or edges, and subject-object edges between nodes. 

Following Quillian’s memory models, a great variety of semantic net- 
work formalisms were proposed; an overview of their history can be 
found in [Brachman, 1979]. In general, semantic networks distinguish 
between concepts (denoted by generic nodes) and individuals (denoted 
by individual nodes), and between subclass-superclass edges and prop- 
erty edges. Using subclass-superclass links, concepts can be organised 
in a specialization hierarchy. Using property edges, properties can be 
associated to concepts, that is, to the individuals belonging to the 
concept the properties are associated with. Figure 4.1 contains a hi- 
erarchy of animals, birds, fishes, etc. Interestingly, the cognitive ade- 
quacy of this approach was proven empirically [Collins and Quillian, 
1970]. 

The two kinds of edges interact with each other: a property is inherited 
along subclass-superclass edges - if not modified in a more specific class. 
For example, birds are equipped with skin because animals are equipped 
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with skin, and birds inherit this property because of the subclass-superclass 
edge between birds and animals. In contrast, although ostriches are birds, 
they do not inherit the property “can fly” from birds because this property 
is “modified” for ostriches. 

Intuitively, it should be possible to translate subclass-superclass edges 
into concept definitions, for example, 1 

Shark = Fish n CanBite n IsDangerous. 

According to Brachman [1985], the above translation is not always intended. 
Subclass-superclass edges can also be read as primitive concept definitions, 
that is, they impose only necessary properties but not sufficient ones. Hence 
the above translation might better be 

Shark C Fish n CanBite n IsDangerous. 

Due to the lack of a precise semantics, there are even more readings of 
subclasssuperclass edges, which are discussed in [1975; 1977b; 1985]. A 
prominent reading is that of inheritance by default , which can be specified 
in different ways, thus leading to misunderstandings and to the question 
which of these specifications is the “right” one (see also Chapter 6). 

As a consequence of this ambiguity, new formalisms mainly evolved along 
two lines: (1) To capture inheritance by default, various non- monotonic in- 
heritance systems, respectively various ways of reasoning in non-monotonic 
inheritance systems, were investigated [Touretzky et al, 1987; 1991; Sel- 
man and Levesque, 1993]. (2) To capture the monotonic aspects of seman- 
tic networks, a new graphical formalism, structured inheritance networks, 
was introduced and implemented in the system Kl-One [Brachman, 1979; 
Brachman and Schmolze, 1985]. It was designed to cover the declarative, 
monotonic aspects of semantic networks, and hence did not specify the way 
in which (non-monotonic multiple) inheritance was supposed to function in 
conflicting situations. Brachman and Schmolze [1985] argue that Kl-One 
does not allow cancellation or inheritance by default because such mecha- 
nisms would make taxonomies meaningless. Indeed, all properties of a given 
concept could be canceled, so that it would fit everywhere in the taxonomy. 
Their proposition is to make a strict separation of default assertions and 
conceptual descriptions. 

Brachman and Schmolze [1985], besides pointing out the computation of 
the taxonomy as a core system service, describe the meaning of various 
concept constructors that were implemented in Kl-One, for example con- 
junction, universal value restrictions, role hierarchies, role- value-maps, etc. 

1 In the following, we use standard Description Logics as defined in Chapter 2. 
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Moreover, we find a clear distinction between individuals and concepts, and 
between a terminological and an assertional formalism. 

Later [Levesque and Brachman, 1987], Kl-One was provided with a 
well-defined “Tarski-style” semantics which fixed the precise meaning of its 
graphical constructs and led to the definition of the first Description Logic 
[Levesque and Brachman, 1987], at that time also called terminological lan- 
guages, concept languages, or KL-ONE-based languages. Besides giving a 
precise meaning to semantic networks, this formalization allowed the investi- 
gation of inference algorithms with respect to their soundness, completeness, 
and computational complexity. For example, it turned out that subsumption 
in Kl-One is undecidable, mainly due to role-value-maps [Schmidt-Schaufi, 
1989]. 


4-1.2 Frame systems 

Minsky [l98l] introduced frame systems as an alternative to logic-oriented 
approaches to knowledge representation, which he thought were not ade- 
quate to “simulate common-sense thinking” for various reasons. His system 
provides record-like data structures to represent prototypical knowledge con- 
cerning situations and objects and includes defaults, multiple perspectives, 
and analogies. Nowadays, semantic networks and frame systems are often 
viewed as the same family of formalisms. However, in standard semantic 
networks, properties are restricted to primitive, atomic ones, whereas, in 
general, properties in frame systems can be complex concepts described by 
frames. 

One goal of the frame approach was to gather all relevant knowledge about 
a situation (e.g., entering a restaurant) in one object instead of distribut- 
ing this knowledge across various axioms. Roughly speaking, a situation (or 
an object) is described in one frame. A frame contains slots, similar to en- 
tries in a record, to represent properties of the situation described by the 
frame. Reasoning comes in two shapes: (1) Using a “partial matching”, more 
specific frames are embedded into more general ones, thus giving, for exam- 
ple, meaning to a new situation or classifying an object as a kind of, say, 
bird. (2) Searching for slot fillers to collect more information concerning 
a specific situation. A variety of expert systems [Fikes and Kehler, 1985; 
Christaller et al, 1992; Gen, 1995; Flex, 1999] are based on a frame-based 
formalism and are further enhanced with rules, triggers, daemons, etc. 

Despite the fact that frame systems were designed as an alternative to 
logic, the monotonic, declarative part of this formalism could be shown to 
be captured using first-order predicate logic [Hayes, 1977; 1979]. To our 
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knowledge, no precise semantics could be given for the non-declarative, non- 
logical, or non-monotonic aspects of frame systems. Hence neither their ex- 
pressive power nor the quality of the corresponding reasoning algorithms 
and services can be compared with other formalisms. 

In the remainder of this section, we show how the monotonic part of a 
frame-based knowledge base can be translated into an ACUM TBox [Cal- 
vanese et al. , 1994], 2 Since there is no standard syntax for frame systems, we 
have chosen to use basically the notation adopted by Fikes and Kehler [1985] , 
which is used also in the Kee 3 system. 

A frame definition is of the form Frame : F in KB F E, where F is a 
frame name and E is a frame expression , i.e., an expression formed according 
to the following syntax: 

E — > Superclasses : F\.. . . . 

MemberSlot : S\ 

ValueClass : H\ 

Cardinality.Min : rn \ 

Cardinality.Max : ri\ 

MemberSlot : Sk 
ValueClass : ///,. 

Cardinality.Min : rn^ 

Cardinality.Max : n*,. 

F.j denotes a frame name, Sj denotes a slot name, rrij and nj denote positive 
integers, and Hj denotes slot constraints. A slot constraint can be specified 
as follows: 

H — ► F | 

(INTERSECTION Hi H 2 ) \ 

(UNION Hi H 2 ) | 

(NOT H). 

A frame knowledge base F is a set of frame definitions. 

For example, Figure 4.2 shows a simple Kee knowledge base describing 
courses in a university. Cardinality restrictions are used to impose a mini- 
mum and maximum number of students that may be enrolled in a course, 
and to express that each course is taught by exactly one individual. The 
frame AdvCourse represents courses which enroll only graduate students, 

2 Not only the translation but also the example are by Calvanese et al. [1994]. 

3 Kee is a trademark of Intellicorp. Note that Kee users do not directly specify their knowledge 
base in this notation, but are allowed to define frames interactively via the graphical system 
interface. 
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Frame: Course in KB University 
MemberSlot: enrolls 
ValueClass: Student 
Cardinality.Min: 2 
Cardinality.Max: 30 
MemberSlot: taughtby 

ValueClass: (UNION GradStudent 
Professor) 

Cardinality.Min: 1 
Cardinality.Max: 1 

Frame: AdvCourse in KB University 
Superclasses: Course 
MemberSlot: enrolls 

ValueClass: (INTERSECTION 
GradStudent 
(NOT Undergrad)) 
Cardinality.Max: 20 


Frame: BasCourse in KB University 
Superclasses: Course 
MemberSlot: taughtby 
ValueClass: Professor 

Frame: Professor in KB University 

Frame: Student in KB University 

Frame: GradStudent in KB University 
Superclasses: Student 
MemberSlot: degree 
ValueClass: String 
Cardinality.Min: 1 
Cardinality.Max: 1 

Frame: Undergrad in KB University 
Superclasses: Student 


Fig. 4.2. A Kee knowledge base. 


i.e., students who already have a degree. Basic courses, on the other hand, 
may be taught only by professors. 

Hayes [1979] gives a semantics to frame definitions by translating them to 
first-order formulae in which frame names are translated to unary predicates, 
and slots are translated to binary predicates. 

In order to translate frame knowledge bases to AOJ.H knowledge bases, 
we first define the function U/ that maps each frame expression into an 
ACUM concept expression as follows: Each frame name F is mapped onto 
an atomic concept 'h(.F), each slot name S onto an atomic role (S') , and 
each slot constraint H onto the corresponding Boolean combination vH( H ) 
of concepts. Then, every frame expression of the form 


Superclasses : F\, ... , A), 
MemberSlot : S± 
ValueClass : Hi 
Cardinality.Min : mi 
Cardinality.Max : n\ 

MemberSlot : Sj~ 
ValueClass : Hj. 
Cardinality.Min : mk 
Cardinality.Max : 
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Course C Venrolls. Student n ^ 2 enrolls n ^ 30 enrolls n 

Vtaughtby.( Professor U GradStudent) n = 1 taughtby 
AdvCourse C Course n Venrolls. (GradStudent n -lUndergrad) n ^ 20 enrolls 
BasCourse C Course n Vtaughtby. Professor 
GradStudent C Student n Vdegree. String n = 1 degree 
Undergrad C Student 

Fig. 4.3. The ALUM knowledge base corresponding to the Kee knowledge base in 
Figure 4.2. 

is mapped into the concept 

'L(Fi) n • • • n 'S'(Fh) n 

n>mi tf(Si) n<m^(5i) n 

V* (S fc ).tf n > m k (S fc ) n < n fc (S fc ). 

Making use of the mapping \F, we obtain the ALUM knowledge base 
'h(^ r ) corresponding to a frame knowledge base F, by introducing in 
^(F) an inclusion assertion 'S’(F) C 'L(F) for each frame definition 

Frame : F in KB F E in F . 

The ALUM knowledge base corresponding to the Kee knowledge base 
given in Figure 4.2 is shown in Figure 4.3. 

The correctness of the translation follows from the correspondence be- 
tween the set-theoretic semantics of ALUM and the first-order interpretation 
of frames [Hayes, 1979; Borgida, 1996; Donini et al., 1996b]. Consequently: 

• verifying whether a frame F is satisfiable in a knowledge base and 

• identifying which of the frames are more general than a given frame 

are captured by concept satisfiability and concept subsumption in ALUM 
knowledge bases. Hence reasoning for the monotonic, declarative part of 
frame systems can be reduced to concept satisfiability and concept sub- 
sumption in ALUM knowledge bases. 


4-1.3 Conceptual graphs 

Besides Description Logics, conceptual graphs [Sowa, 1984] can be viewed 
as descendants of frame systems and semantic networks. Conceptual graphs 
(CGs) are a rather popular (especially in natural language processing) 
and expressive formalism for representing knowledge about an application 
domain in a graphical way. They are given a formal semantics, e.g., by 
translating them into (first-order) formulae. 

In the CG formalism, one is, just as for Description Logics, not only 
interested in representing knowledge, but also in reasoning about it. 
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Reasoning services for CGs are, for example, deciding whether a given graph 
is valid, i.e., whether the corresponding formula is valid, or whether a graph 
g is subsumed by a graph h, i.e., whether the formula corresponding to 
g implies the formula corresponding to h. Since CGs can express all of 
first-order predicate logic [Sowa, 1984], these reasoning problems are un- 
decidable for general CGs. In the literature [Sowa, 1984; Wermelinger, 1995; 
Kerdiles and Salvat, 1997] one can find complete calculi for validity of CGs, 
but implementations of these calculi may not terminate for formulae that 
are not valid. An approach to overcoming this problem, which has also been 
employed in the area of Description Logics, is to identify decidable fragments 
of the formalism. The most prominent decidable fragment of CGs is the class 
of simple conceptual graphs (SGs) [Sowa, 1984], which corresponds to the 
conjunctive, positive, and existential fragment of first-order predicate logic 
(i.e., existentially quantified conjunctions of atoms). Even for this simple 
fragment, however, subsumption is still an NP-complete problem [Chein and 
Mugnier, 1992].' 4 

Although Description Logics and CGs are employed in very similar appli- 
cations, precise comparisons were published, to our knowledge, only recently 
[Coupey and Faron, 1998; Baader et al, 1999c]. These comparisons are based 
on translations of CGs and DL concepts into first-order formulae. It turned 
out that the two formalisms are quite different for several reasons: 

1. CGs are translated into closed first-order formulae, whereas DL concepts are 
translated into formulae in one free variable; 

2. since Description Logics use a variable- free syntax, certain identifications of 
variables expressed by cycles in SGs and by co-reference links in CGs cannot 
be expressed in Description Logics; 

3. in contrast to CGs, most Description Logics considered in the literature only 
allow unary and binary relations and not relations of arity greater than 2; 

4. SGs are interpreted by existential sentences, whereas almost all Description 
Logics considered in the literature allow for universal quantification. 

Possibly as a consequence of these differences, so far no natural fragment 
of CGs that corresponds to a Description Logic has been identified. In the 
sequel, we will illustrate the main aspects of the correspondence result pre- 
sented by Baader et al. [1999c], which strictly extends the one proposed by 
Coupey and Faron [1998]. 


4 Since SGs are equivalent to conjunctive queries (see also Chapter 16), the NP-completeness of 
subsumption of SGs is also an immediate consequence of NP-completeness of containment of 
conjunctive queries [Chandra and Merlin, 1977]. 
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concept types: relation types: individual marker: 



Male Female Student 

Fig. 4.4. An example of a support. 


Simple conceptual graphs 

Simple conceptual graphs (SGs) as introduced by Sowa [1984] are the most 
prominent decidable fragment of CGs. They are defined with respect to a so- 
called support. Roughly speaking, the support is a partially ordered signature 
that can be used to fix the a primitive ontology of a given application domain. 
It introduces a set of concept types (unary predicates), a set of relation 
types (n-ary predicates), and a set of individual markers (constants). As an 
example, consider the support S shown in Figure 4.4, where T is the most 
general concept type representing the entire domain. The partial ordering 
on the individual markers is flat, i.e. , all individual markers are pairwise 
incomparable and the so-called generic marker * is more general than all 
individual markers. In this example, all relation types are assumed to have 
arity 2 and to be pairwise incomparable except for hasDescendants, which is 
more general than hasChild. The partial orderings on the types yield a fixed 
specialization hierarchy for these types that must be taken into account 
when computing subsumption relations between SGs. For binary relation 
types, this partial ordering resembles a role hierarchy in Description Logics. 

An SG over the support S is a labeled bipartite graph of the form g = 
(C, R, E,£), where C is a set of concept nodes , R is a set of relation nodes, 
E C C x R is the edge relation L, and l is a labeling of concept nodes. 

As an example, consider the SGs depicted in Figure 4.5: the SG g describes 
a woman Mary having a child who likes its grandfather Peter and who 
attends the computer science course number KR101; the node do in the SG 
h describes all mothers having a child who likes one of its grandparents. 

Each concept node is labeled by i with a concept type (such as Female) 
and a referent, i.e., an individual marker (such as MARY) or the generic 
marker *. A concept node is called generic if its referent is the generic 
marker; otherwise, it is called an individual concept node. Each relation node 
is labeled with a relation type r (such as hasChild), and its outgoing edges 
are labeled with indices according to the arity of r. For example, for the 
binary relation hasChild, there is one edge labeled with 1 (leading to the 
parent), and one edge labeled with 2 (leading to the child). 
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Fig. 4.5. Two simple graphs. 


Simple graphs are given a formal semantics in first-order predicate logic 
(FOL) by the operator $ [Sowa, 1984]: each generic concept node is related 
to a unique variable, and each individual concept node is related to its indi- 
vidual marker. Concept types and relation types are translated into atomic 
formulae, and the whole SG g is translated into the existentially closed con- 
junction of all atoms obtained from the nodes in g. 

In our example, this operator yields 

$(<7) = 3xi.(Female(MARY) A Human(PETER) A Student(xi) A 
CScourse(KRlOl) A hasChild(PETER, MARY) A 
hasChild(MARY, x\) A likes(aq, PETER) A attends(.xi, KR101)), 

4 >(/i) = 3xoxi.T2.(Female(xo) A Fluman(a:i) A Pluman(x 2 ) A 

hasChild(xi, xo) A hasChild(xo, £2) A Iikes(x 2 , aq)), 

where x\ in $( 5 ) is (resp. xq, x\ , and X2 in $(/i) are) introduced for the 
generic concept node C 2 (resp. the generic concept nodes do, d\, and da). 

In general, there are three different ways of expressing conjunction of con- 
cept types. For example, suppose we want to express that Mary is both 
female and a student. This can be expressed by an SG containing one in- 
dividual concept node for each statement (see Figure 4. 6(1 )). 5 A second 
possibility is to introduce a new concept type in the support for a common 
specialization of Female(MARY) and Student(MARY) (see Figure 4.6(2)). Fi- 
nally, such a conjunction can be represented by labeling the corresponding 
concept node with a set of concept types instead of a single concept type 
(see Figure 4.6(3); for details on how to handle SGs labeled with sets of 
concept types see [Baader et al, 1999c]). 

5 Note that this solution could not be applied if the individual marker MARY were replaced by 
the generic marker *, because the two resulting generic concept nodes would be interpreted by 
different variables. 
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(1) (2) T (3) 



Female: MARY 


FemStud 



Student: MARY 

do 

FemStud: MARY 

e 0 

{Female, Student}: MARY 


Fig. 4.6. Expressing conjunction of concept types in SGs. 


Subsumption with respect to a support S for two SGs g, h is defined by a 
so-called projection from htog [Sowa, 1984; Chein and Mugnier, 1992]: g is 
subsumed by h w.r.t. S iff there exists a mapping from h to g that (1) maps 
concept nodes (resp. relation nodes) in h onto more specific (w.r.t. the partial 
ordering in S) concept nodes (resp. relation nodes) in g and that (2) pre- 
serves adjacency. 

In our example (Figure 4.5), it is easy to see that g is subsumed by h, 
since mapping di onto c* for 0 < i < 2 yields a projection w.r.t. S from h to 
9- 

Subsumption for SGs is an NP-complete problem [Chein and Mugnier, 
1992], In the restricted case where the subsumer h is a tree, subsumption 
can be decided in polynomial time [Mugnier and Chein, 1992]. 

Concept descriptions and simple graphs 
In order to determine a Description Logic corresponding to (a fragment of) 
SGs, one must take into account the differences between Description Logics 
and CGs mentioned before. 

• Most Description Logics only allow role terms corresponding to binary rela- 
tions and concept descriptions describing connected structures. Thus, Baader 
et al. [1999c] and Coupey and Faron [1998] restrict their attention to connected 
SGs over a support S containing only unary and binary relation types. 

• Due to the different semantics of SGs and concept descriptions (closed formulae 
vs. formulae in one free variable), Coupey and Faron restrict their attention to 
SGs that are trees. Baader et al. introduce so-called rooted SGs, i.e., SGs that 
have one distinguished node called the root. An adaption of the operator 4> yields 
a translation of a rooted SG g into an FO formula 4>(</)(a;o) with one free variable 
x 0 - 

• Since all Description Logics considered in the literature allow conjunction of con- 
cepts, Baader et al. allow concept nodes labeled with a set of concept types 
instead of a single concept type in order to express conjunction of atomic 
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ci : Human, {PETER}c 2 : Male, Student 


attends 

I 

C3 : CScourse 



Fig. 4.7. Translating concept descriptions into simple graphs. 

concepts in SGs. Coupey and Faron avoid the problem of expressing conjunction 
of atomic concepts: they just do not allow (1) conjunctions of atomic concepts in 
concept descriptions, and (2) individual concept nodes in SGs. 

The Description Logic considered by Baader et al, denoted by SCTTZO 1 , 
allows existential restrictions and intersection of concept descriptions (SC), 
inverse roles (X), intersection of roles (1Z), and unary one- of concepts {Of). 
For the constants occurring in the one-of concepts the unique name assump- 
tion applies, i.e. , all constants are interpreted as different objects. Coupey 
and Faron only consider a fragment of the Description Logic SCI. 

In both papers, the correspondence result is based on translating concept 
descriptions into syntax trees. For example, consider the SCTTZO i-concept 

C = Female n BhasChild - . (Human n {PETER}) n 

3(hasChild n I ikes). (Male n Student n Eiattends. CScourse) 

describing all daughters of Peter who have a fond child that is a student 
attending a computer science course. The syntax tree corresponding to C is 
depicted on the left-hand side of Figure 4.7. 

One can show [Baader et al, 1999c] that, if concept descriptions C are 
restricted to contain at most one unary one-of concept in each conjunc- 
tion, the corresponding syntax tree Tc can be easily translated into an 
equivalent rooted SG gc that is a tree 6 (see Figure 4.7). Conversely, ev- 
ery rooted SG g that is a tree and that contains only binary relation 
types can be translated into an equivalent SCXlZO \- concept description 
C g . There are, however, rooted SGs that can be translated into equiva- 
lent SCTTZO i-concept descriptions though they are not trees. For exam- 
ple, the rooted SG g depicted in Figure 4.5 is equivalent to the concept 


In this context, a tree may contain more than one relation between two adjacent concept nodes. 
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description 

C g = {MARY} FI Female n3hasChilcT. (Human n {PETER}) n 

BhasChild. (Student n 3attends.({KR101} n CScourse) n 3likes. {PETER}). 

In general, the above correspondence result can be strengthened as follows 
[Baader et al, 1999c]: every rooted SG g containing only binary relation 
types can be transformed into an equivalent rooted SG that is a tree if 
each cycle in g with more than two concept nodes contains at least one 
individual concept node. Hence, each such rooted SG can be translated into 
an equivalent ££Z7?.0i-concept description. 

Note that the SG h with root do in Figure 4.5 cannot be translated into 
an equivalent SCXlZO \ -concept description Ch because, in SCITZOi, one 
cannot express that the grandparent (represented by the concept node d\) 
and the human liked by the child (represented by the concept node cfo) nrust 
be the same person. 

The correspondence result between SCllZOi and rooted SGs allows the 
tractability result for subsumption between SGs that are trees to be trans- 
ferred to ECTlZO\ . Furthermore, the characterization of subsumption based 
on projections between graphs was adapted to SCTTZOi and other Descrip- 
tion Logics, e.g., ACS, and is used in the context of inference problems like 
matching and computing least common subsumers [Baader and Kristers, 
1999; Baader et al., 1999b]. Conversely, the correspondence result can be 
used as a basis for determining more expressive fragments of conceptual 
graphs, for which validity and subsumption are decidable. Based on an ap- 
propriate characterization of a fragment of conceptual graphs corresponding 
to a more expressive Description Logic (like ACC), one could use algorithms 
for these Description Logics to decide validity or subsumption of graphs in 
this fragment. 


4.2 Logical formalisms 

In this section, we will investigate the relationship between Description Log- 
ics and other logical formalisms. 

Traditionally, the semantics of Description Logics is given in a Tarski- 
style model-theoretic way. Alternatively, it can be given by a translation 
into predicate logic, where it depends on the Description Logic whether 
this translation yields first-order formulae or whether it goes beyond first- 
order, as is the case for Description Logics that allow, e.g., the transitive 
closure of roles or fixpoints. Due to the variable-free syntax of Description 
Logics and the fact that concepts denote sets of individuals, the translation 
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of concepts yields formulae in one free variable. Following the definition 
by Borgida [1996], a concept C and its translation n(C)(x) are said to be 
equivalent, if and only if, for all interpretations 7 X = (A- 7 , - x ) and all a G A x , 
we have 

a G C x iff X j= ir(C)(a). 

A Description Logic VC is said to be less expressive than a logic C if there 
is a translation that translates all XX-concepts into equivalent C formulae. 
Such a translation is called preserving. 

Note that there are various other ways in which equivalence of formu- 
lae and logics being “less expressive than” others could have been defined 
[Baader, 1996a; Kurtonina and de Rijke, 1997; Areces and de Rijke, 1998]. 
For example, a less strict definition is the one that only asks the translation 
to preserve satisfiability. 

To start with, we give a translation n that translates AXC-concepts into 
predicate logic and which will be useful in the remainder of this section. 
For those familiar with modal logics, note that this translation parallels the 
one from propositional modal logic [van Benthem, 1983; 1984]; the close 
relationship between modal logic and Description Logic will be discussed in 
Subsection 4.2.2. For ACC , the translation of concepts into predicate logic 
formulae can be defined in such a way that the resulting formulae involve 
only two variables, say x, y. and only unary and binary predicates. In the 
following, C k denotes the first-order predicate logic over unary and binary 
predicates with k variables. 

The translation is given by two mappings ir x and ir y from AXC-concepts 
into C 2 formulae in one free variable. Each concept name A is also viewed 
as a unary predicate symbol, and each role name R is viewed as a binary 
predicate symbol. For AXC-concepts, the translation is inductively defined 
as follows: 

-Kx(A) = A(x) n y (A) = A{y) 

tt x {CC\D) = n x (C) A ir x (D) TTy(CnD) = ir y (C) A n y (D) 

TT X (C U D) = TT X (C) V 7 T x (D) TT y (C U D) = TT y (C ) V 7 T y (D) 

ir x (3R.C ) = 3y.R(x, y) A ir y {C) 7 r y (3R.C) = 3x.R(y, x) A tt x (C) 

n x (VR.C) = \/y.R(x,y) D ir y (C) 7 T y (VR.C) = Vx.R(y,x) D n x (C). 

Other concept and role constructors that can easily be translated into first- 
order predicate logic without involving more than two variables are inverse 
roles, conjunction, disjunction, and negation on roles, and one-of. 8 


7 In the following, we view interpretations both as DL and predicate logic interpretations. 

8 In this case, the translation is to C 2 with constants. 
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If a Description Logic allows number restrictions ^ nR , R, the trans- 
lation involves either counting quantifiers 3- n , 3- (and still involves only 
two variables) or equality (and involves an unbounded number of variables): 

n x (^ n R) = 3- n y.R(x, y) = 3y 1 ,..., y n . /\ ijLj y % yj A /\ t R(x, y, : ) 
tt x (^ n R) = 3^ n y.R(x, y) = Vyi, . . . , y n+1 . /\ i¥=j yi yj D Vi ~^R(x, yfi. 

For qualified number restrictions, the translations can easily be modified 
with the same effect on the number of variables involved. 

So far, all Description Logics have been less expressive than first-order 
predicate logic (possibly with equality or counting quantifiers). In contrast, 
the expressive power of a Description Logic including the transitive closure 
of roles goes beyond first-order logic: First, it is easy to see that expressing 
transitivity ( p + (x , y) A p + (y, z )) D p + (x, z ) involves at least three variables. 
To express that a relation p + is the transitive closure of p, we first need 
to enforce that p + is a transitive relation including p - which can easily be 
axiomatized in first-order predicate logic. Secondly, we must enforce that p + 
is the smallest transitive relation including p - which, as a consequence of 
the Compactness Theorem, cannot be expressed in first-order logic. 


Internalization of Knowledge Bases: So far, we have been concerned 
with preserving translations of concepts into logical formulae, and thus could 
reduce satisfiability of concepts to satisfiability of formulae in the target 
logic. In Description Logics, however, we are also concerned with concept 
consistency and logical implication w.r.t. a TBox, and with ABox consis- 
tency w.r.t. a TBox. 

Furthermore, TBoxes differ in whether they are restricted to be acyclic, 
allow cyclic definitions, or allow general concept inclusion axioms (see Chap- 
ter 2 for details). In first-order logic, the equivalent to a TBox asser- 
tion is simply a universally quantified formula, and thus it is not neces- 
sary to make the above-mentioned distinction between, for example, pure 
concept satisfiability and satisfiability with respect to a TBox - provided 
that cyclic TBoxes are read with descriptive semantics [Baader, 1990a; 
Nebel, 1991] (cyclic TBoxes read with least or greatest fixpoint semantics go 
beyond the expressive power of first-order predicate logic). In the following, 
we consider only the most expressive form of TBoxes, namely those allowing 
general concept inclusion axioms. Given a preserving translation ir from DL 
concepts into first-order formulae and a TBox T = {Ci C D* | 1 < i < n}, 
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we define 

n 

vr(T) = Mx. f \{ir x (Ci ) D 7r x (A))- 

i=l 

Then it is easy to show that: 

• A concept C is satisfiable with respect to T iff the formula 7 t x (C) A 7t(T) is 
satisfiable. 

• A concept C is subsumed by a concept D with respect to T iff the formula 
n x (C) A ^ir x (D) A 7t(T) is unsatisfiable. 

• Given two index sets /, J, an ABox {Rk{ai, aj) | ( i,j , k) £ 1} U {Cy(aj) | (i,j) € 
J} is consistent with T iff the formula 

f\ Rk(a z ,aj) A f\ Tr x (Cj)(ai) A n(T) 

(i,j,k)el 

is satisfiable, where the a^-s in the formula are constants corresponding to the 
individuals in the ABox. 

Observe that, if all concepts in a TBox T can be translated to L '? 
(resp. C 2 ), then the translation 7r(T) of T is also a formula of C 2 (resp. C 2 ). 

Hence in first-order logic, reasoning with respect to a knowledge base 
(consisting of a TBox and possibly an ABox) is not more complex than 
reasoning about concept expressions alone - in contrast to the complexity 
of reasoning for most Description Logics, where considering even acyclic 
TBoxes can make a considerable difference (for example, see [Calvanese, 
1996b; Lutz, 1999a]). This gap is not surprising since first-order predicate 
logic is far more complex than most Description Logics, namely undecidable. 

In the following, we investigate logics that are more closely related to 
Description Logics, namely restricted variable fragments, modal logics, and 
guarded fragments. 


4-2.1 Restricted variable fragments 

One way to define decidable fragments of first-order logic is to restrict the set 
of variables which are allowed inside formulae and the arity of relation sym- 
bols. As mentioned in the previous section, we use C k to denote first-order 
predicate logic over unary and binary predicates with at most k variables. 
Analogously, C k denotes first-order predicate logic over unary and binary 
predicates with at most k variables and counting quantifiers 3- n , 3- n . 

With the exception of the Description Logics introduced by Calvanese 
et al. [1998a] and Lutz et al. [1999], the translation of DL concepts into 
predicate logic formulae involves predicates of arity at most 2. 
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From the translations in the previous section, it follows immediately that: 

• ACC1Z is less expressive than C 2 and that 

• ACCATlZ is less expressive than C 2 . 

As we have shown above, general TBox assertions can be translated into 
C? formulae. These facts together with the linearity of the translation yield 
upper bounds for the complexity of ACC1Z and ACCAflZ (even though these 
bounds are far from being tight): C 2 and C 2 are known to be NExpTime- 
complete [Gradel et al, 1997a; Pacholski et al, 2000] (for C 2 , this is true 
only if numbers in counting quantifiers are assumed to be coded in unary, 
an assumption often made in Description Logics); hence satisfiability and 
subsumption with respect to a (possibly cyclic) TBox are in NExpTime for 
ACC1Z and ACCAflZ. 

However, both C? and C 2 are far more expressive than ACC1Z and 
ACCAfIZ, respectively. For example, both logics allow the negation of binary 
predicates, i.e. , subformulae of the form -■ R(x, y ). In Description Logics, this 
corresponds to negation of roles, an operator that is rarely considered in De- 
scription Logics, except in the weakened form of difference 9 [De Giacomo, 
1995; Calvanese et al. , 1998a] (exceptions are the work by Mameide and 
Montero [1993] and Lutz and Sattler [2000b], which deal with genuine nega- 
tion of roles). Moreover, C? and C 2 allow “global” quantification, i.e., formu- 
lae of the form 3x.<f>(x) or Vx.'L(a:) that talk about the whole interpretation 
domain. In contrast, quantification in Description Logics is, in general, “lo- 
cal”, e.g., concepts of the form MR.C only constrain all R-successors of an 
individual. 

Borgida [1996] presents a variety of results stating that a certain Descrip- 
tion Logic is less expressive than, or as expressive as, a certain fragment of 
first-order logic. We mention only the most important ones: 

• ACC extended with 

(role constructors) full Boolean operators on roles, inverse roles, cross-product 
of two concepts, an identity role id, and 
(concept constructors) individuals (“one-of”), 

is as expressive as C 2 (and therefore decidable and, more precisely, NExpTime- 
complete) . 

• A further extension of this logic with all sorts of role- value-maps is as expressive 
as C 3 (and therefore undecidable) . 


9 Difference of roles is easier to deal with than genuine negation, since it does not destroy 
“locality” of quantification. 
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Since both extensions include full Boolean operators on roles, they can sim- 
ulate a universal role using the complex role R U —>R, and thus general TBox 
assertions can be internalized (see Chapter 5). Thus, for these two exten- 
sions, reasoning with respect to (possibly cyclic) TBoxes can be reduced to 
pure concept reasoning - i.e., the TBox can be internalized - and the above 
complexity results include both sorts of reasoning problems. 

Later, a second Description Logic was presented that is as expressive as 
C 2 [Lutz et al . , 2001a]. In contrast to the logic in [Borgida, 1996], this logic 
does not allow a role to be built as the cross-product of two concepts, and 
it does not provide individuals. However, using the identity role id (with 
id 1 = {(x, x) | x G A 1 } for all interpretations X), we can guarantee that (the 
atomic concept) N is interpreted as an individual, i.e., a singleton set, using 
the following TBox axiom: 

T C 3(R LI -lR).(IV n V-ud.-dV). 


4-2.2 Modal logics 

Modal logics and Description Logics have a very close relationship, which 
was first described in [Schild, 1991]. In a nutshell, Schild [1991] points out 
that ACC can be seen as a notational variant of the multi-modal logic 
K m . Later, a similar relationship was observed between more expressive 
modal logics and Description Logics [De Giacomo and Lenzerini, 1994a; 
Schild, 1994], namely between (extensions of) Propositional Dynamic Logic 
pdl and (extensions of) ACC reg , i.e., ACC extended with regular roles. Fol- 
lowing and exploiting these observations, various (complexity) results for 
Description Logics were found by translating results from modal or propo- 
sitional dynamic logics and the //-calculus to Description Logics [De Gi- 
acomo and Lenzerini, 1994a; 1994b; Schild, 1994; De Giacomo, 1995]. 
Moreover, upper bounds for the complexity of satisfiability problems were 
tightened considerably, mostly in parallel with the development of deci- 
sion procedures suitable for implementations and optimization techniques 
for these procedures [De Giacomo and Lenzerini, 1995; De Giacomo, 1995; 
Horrocks et al., 1999]. In the following, we will describe the relation between 
modal logics and Description Logics in more detail. 

We start by introducing the basic modal logic K; for a nice introduction 
and overview see [Halpern and Moses, 1992; Blackburn et al., 2001]. Given 
a set of propositional letters pi,P 2 ,- ■ ■ , the set of formulae of the modal logic 
K is the smallest set that: 


• contains pi,P 2 , ■ ■ ■ , 
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• is closed under Boolean connectives A, V, and -i, and 

• if it contains /, then it also contains d<f> and Otp. 

The semantics of modal formulae is given by so-called Kripke structures 
M = (S, 7r,/C), where S is a set of so-called states or worlds (which corre- 
spond to individuals in Description Logics), ir is a mapping from the set 
of propositional letters into sets of states (i.e., 7r (pi) is the set of states in 
which pi holds), and /C is a binary relation on the states S, the so-called ac- 
cessibility relation (which can be seen as the interpretation of a single role). 
The semantics is then given as follows, where, for a modal formula 4> and a 
state s£ 5, the expression M, s |= (j) is read as “</> holds in M in state s” . 

M, s\= pi iff s G 7T (p^ 

M, s \= (j)\ A 4*2 iff M, s j= (j>i and M, s \= fa 

M , s \= 4>iW <j> 2 iff M, s |= 4>i or M , s \= 

M, s 1= - «f> iff M, s (j) 

M , s \= Ocj> iff there exists s' G S with (s, s') G tC and M, s' |= <f> 

M , s \= □( f> iff for all s' G S, if (s, s') G /C, then M, s' \= <j>. 

In contrast to many other modal logics, K does not impose any restric- 
tions on the Kripke structures. For example, the modal logic S4 is obtained 
from K by restricting the Kripke structures to those where the accessibility 
relation K. is reflexive and transitive. Other modal logics restrict /C to be 
symmetric, well-founded, an equivalence relation, etc. Moreover, the num- 
ber of accessibility relations may be different from one. Then we are talking 
about multi-modal logics, where each accessibility relation /Q can be thought 
to correspond to one agent, and is quantified using the multi-modal opera- 
tors dj and Oj (or, alternatively [i] and (i)). For example, K m stands for 
the multi-modal logic K with m agents. 

To establish the correspondence between the modal logic K m and the 
Description Logic ACC, Schild [1991] gave the following translation / from 
„4£C-concepts using role names R\, ... , R m to K m : 

f(A) = A 

f(CnD) = /(C) A /(D) 

/(CUD) = /(C) V /(D) 
fh(C)) = -(/(C)) 
f(\/Ri.C) = □ i (/(C)) 
f(3Ri.C) = O i(f(C)). 

Now, Kripke structures can easily be viewed as DL interpretations and vice 
versa. Then, from the semantics of K m and ACC, it follows immediately that 
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a is an instance of an „4£C-concept C in an interpretation X iff its translation 
f{C ) holds in a in the Kripke structure corresponding to X. Obviously, we 
can define an analogous translation from K m formulae into ACC. 

There exists a large variety of modal logics for a variety of applications. 
In the following, we will sketch some of them together with their relation to 
Description Logics. 

Propositional Dynamic Logics are designed for reasoning about the be- 
havior of programs. Propositional Dynamic Logic (pdl) was introduced by 
Fischer and Ladner [1979], and proven to have an ExpTiME-complete sat- 
isfiability problem by Fischer and Ladner [1979] and Pratt [1979]; for an 
overview, see [Harel et al, 2000]. pdl was designed to describe the (dynamic) 
behavior of programs: complex programs can be built starting from atomic 
programs by using non-deterministic choice (U), composition (;), and itera- 
tion (•*). pdl formulae can be used to describe the properties that should 
hold in a state after the execution of a complex program. For example, the 
following pdl formula holds in a state if the following condition is satisfied: 
whenever program a or (3 is executed, a state is reached where p holds, and 
there is a sequence of alternating executions of a and (3 such that a state is 
reached where —>p A q holds: 

[a U (3\p A ((a; j3)*)(pp A q). 

Its DL counterpart, ACC reg , was introduced independently by Baader [1991]. 
ACC reg is the extension of ACC with regular expressions over roles 10 and 
can be seen as a notational variant of Propositional Dynamic Logic. For 
this correspondence, see the work by Schild [1991] and De Giacomo and 
Lenzerini [1994a], and Chapter 5. There exist a variety of extensions of pdl 
(or ACC reg ), for example with inverse roles, counting, or difference of roles, 
most of which still have an ExpTime satisfiability problem; see, e.g., [Kozen 
and Tiuryn, 1990; De Giacomo, 1995; De Giacomo and Lenzerini, 1996] and 
Chapter 5. 

The /i-calculus can be viewed as a generalization of dynamic logic, with 
similar applications, and was introduced by Pratt [1981] and Kozen [1983]. 
It is obtained from multi-modal K m by allowing (least and greated) fix- 
point operators to be used on propositional letters. For example, for // 
the least fixpoint operator and X a variable for propositional letters, 

10 Regular expressions over roles are built using union (U) , composition (o), and the Kleene 
operator (•*) on roles and can be used in ACCreg-concepts in the place of atomic roles (see 
Chapter 5). 
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the formula jiX.p V ( a)X describes the states with a (possibly empty) 
chain of a edges into a state in which p holds. In PDL, this formula 
is written (a*)p, and its ACC reg counterpart is 3R* a .p. However, the //- 
calculus is strictly more expressive than PDL or ACC reg : for example, 
the //-calculus can express well-foundedness of a program (binary rela- 
tion), i.e. , there is a //-calculus formula that has only models in which 
a is interpreted as a well-founded relation (that is, a relation with- 
out any infinite chains). In [De Giacomo and Lenzerini, 1994b; 1997; 
Calvanese et al., 1999c], this additional expressive power is shown to be 
useful in a variety of DL applications. The DL counterpart of the //-calculus 
extended with number restrictions [De Giacomo and Lenzerini, 1994b; 
1997] and additionally with inverse roles [Calvanese et al., 1999c] is proven 
to have an ExpTiME-complete satisfiability problem. 

There are two other classes of Description Logics with other forms of 
fixpoints: in Description Logics, fixpoints first came in through (1) the tran- 
sitive closure operator [Baader, 1991], which is naturally defined using a 
least fixpoint, and (2) terminological cycles [Baader, 1990a], which have a 
different meaning according to whether a greatest, least, or arbitrary fixpoint 
semantics is employed [Nebel, 1991; Baader, 1996b; Kiisters, 1998]. 


Temporal logics are designed for reasoning about time-dependent infor- 
mation. They have applications in databases, automated verification of pro- 
grams, hardware, distributed systems, natural language processing, plan- 
ning, etc. and come in various shapes; for a survey of temporal logics, see, 
e.g., [Gabbay et al, 1994], Firstly, they can differ in whether the basic tempo- 
ral entities are time points or time intervals. Secondly, they differ in whether 
they are based on a linear or on a branching temporal structure. In the latter 
structures, the flow of time might “branch” into various succeeding future 
times. Finally, they differ in the underlying logic (e.g., Boolean logic or first- 
order predicate logic) and in the operators provided to speak about the past 
and the future (e.g., operators that refer to the next time point, to all fu- 
ture time points, to a future time point and all its respective future time 
points, etc.). 

In contrast to some other modal logics, temporal logics do not have very 
close Description Logic relatives. However, they are mentioned here because 
they are used to “temporalize” Description Logics; for a survey on temporal 
Description Logics, see [Artale and Franconi, 2001] and Chapter 6. When 
speaking of the “temporalization” of a logic, e.g., ACC , one usually refers 
to a logic with two-dimensional interpretations. One dimension refers to the 
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flow of time, and each state in this flow of time comprises an interpretation 
of the underlying logic, e.g., an ACC interpretation. Obviously, the logic 
obtained depends on the temporal logic chosen for the temporal dimension 
and on the underlying (description) logic. Moreover, one has the choice of 
requiring that the interpretation domain of each time point is the same for 
all states (“constant domain assumption”) or that it is a subset of the do- 
mains of the interpretations underlying future states. Examples of temporal- 
ized Description Logics can be found in [Wolter and Zakharyaschev, 1999d; 
Sturm and Wolter, 2002; Artale et al., 2001; Schild, 1993; Lutz et al., 2001b]. 
An alternative to this temporalization is to extend a Description Logic with 
a temporal concrete domain [Baader and Hanschke, 1991a]. This yields a 
“two-sorted” interpretation domain, consisting of abstract individuals on 
the one hand and time points or intervals on the other hand. Abstract indi- 
viduals are then related to the temporal structure using features (functional 
roles) and the standard concrete domain constructs. An example of such a 
logic is described by Lutz [2001a]. 

Hybrid logics extend standard modal logics with the ability to refer 
to single states (individuals in the interpretation domain) using so-called 
nominals (see, e.g., [Blackburn and Seligman, 1995; Areces et al, 2000; 
Areces, 2000] for hybrid logics related to Description Logics). Nominals are 
simply special propositional variables which hold in exactly one state. Hy- 
brid logics enjoy a variety of “nice” properties whose description goes be- 
yond the scope of this article; for a summary, see [Areces, 2000]. In Descrip- 
tion Logics, there are three standard ways to refer to individuals: (1) we 
can use ABox individuals in ABoxes, (2) we can use the “one-of” concept 
constructor {oi,...,o*,} which can be applied to individual names o* and 
which is present in only a few Description Logics (e.g., in the Description 
Logic described in [Bresciani et al, 1995]), and (3) we can use nominals in 
a similar way as in hybrid logics (e.g., [De Giacomo, 1995; Tobies, 2000; 
Horrocks and Sattler, 2001]), namely as special atomic concepts that are 
interpreted as singleton sets. For most Description Logics, there is a direct 
mapping between nominals and the “one-of” constructor and back: let o* 
stand for individual names and, at the same time, nominals. Then we can 
extend the translation / mentioned above to the “one-of” constructor as 
follows - provided that we make the unique name assumption (see Chap- 
ter 2) either for both the individual names and the nominals or for neither 
of them: 


/({oi, . . . ,o fc }) = f({oi} U • • • U {o fc }) = oi V • • • V o fc . 
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ABox individuals can be viewed as a restricted form of nominals, and each 
ABox in a Description Logic C can be translated into a single concept of 
(the extension of) C with conjunction, existential restriction, and “one-of”: 
first, translate each assertion of the form 

C(a) into {a} n C and 
R(a, b) into {a} n 3i?.{6}. 

Next, for Cj , . . . , C m the resulting concepts of this translation and U a role 

name not occurring in any C), define C = I I 3U.C t . Then each model 

l<i<m 

of C is a model of the original ABox - provided, again, that the unique 
name assumption holds either for both individual names and nominals or for 
neither. Vice versa, each model of the original ABox can easily be extended 
to a model of C. 

So far, we have only mentioned the weakest way in which nominals occur 
in hybrid logics. The next stronger form is formulae of the form (p@Oi which 
describes, intuitively, that <p holds in the state Oj. For U a universal role and 
C v the translation of ip, this formula corresponds to the concept 3U.(oi n 
Cy). Finally, we only point out that there are even more expressive ways 
of talking about nominals in hybrid logics using, for example, variables for 
nominals and quantification over them. 

So much for the relation between certain modal logics and certain De- 
scription Logics. In the remainder of this section, the relationship between 
standard DL constructors and their counterpart in modal logics is discussed. 

Number restrictions: In modal logics, the equivalent to qualified number 
restrictions ^ nR.C and ^ nR.C [Hollunder and Baader, 1991b] is known 
as graded modalities [Fine, 1972; Van der Hoek and de Rijke, 1995], whereas 
no equivalent to the standard, weaker form of number restrictions, ^ n R 
and ^ nR , has been considered explicitly. 

Number restrictions can be said to play a central role in Description Log- 
ics: they are present in almost all knowledge representation systems based on 
Description Logics, several variants have been investigated with respect to 
their computational complexity (e.g., see [Tobies, 1999c] for qualified num- 
ber restrictions, [Baader and Sattler, 1999] for symbolic number restrictions 
and number restrictions on complex roles), and it was proved by De Gia- 
como and Lenzerini [1994a] that reasoning with respect to (possibly cyclic) 
TBoxes for the DL equivalent to converse- pdl extended with qualified num- 
ber restrictions (on atomic and inverse atomic roles) is ExpTiME-complete. 

In contrast, they play a minor role in modal and dynamic logics. A more 
prominent role in dynamic logics is played by deterministic programs, i.e. , 
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programs that are to be interpreted as functional relations (see Chapter 2). 
Ben-Ari et al. [1982] and Parikh [1981] show that validity (and hence satisfi- 
ability) of dpdl (i.e. , the logic that is obtained from pdl by restricting pro- 
grams to be deterministic) is ExpTlME-complete. Moreover, Parikh [1981] 
has shown that pdl formulae can be linearly translated into dpdl formulae, 
and this translation was used by De Giacomo and Lenzerini [1994a] to code 
qualified number restrictions into DPDL formulae. As a consequence, we have 
that satisfiability and subsumption with respect to (possibly cyclic) TBoxes 
in ACC extended with regular expressions over roles and qualified number 
restrictions is in ExpTime. 


Transitivity: In modal logics and Description Logics, transitivity comes in 
(at least) two different shapes, as transitive roles (or frames whose accessibil- 
ity relation is transitive, as in K4 m ) and as the transitive closure operator 
on roles (or the Kleene star operator on programs in pdl). Interestingly, 
these two sorts of transitivity differ in their complexity. 

Fischer and Ladner [1979] prove that satisfiability in pdl is ExpTime- 
complete. However, the only operator on programs (or roles) used in the 
hardness proof is the transitive closure operator. Translated to Description 
Logics, this yields ExpTlME-completeness of satisfiability in ACC extended 
with the transitive closure operator on roles. 

In contrast, K4 m is known to be of the same complexity as K m 
(or A£C), namely PSPACE-complete [Halpern and Moses, 1992], while 
providing transitivity: K4 m is obtained from K m by restricting Kripke 
structures to those where the accessibility relations are transitive. Trans- 
lated into Description Logics, this means that concept satisfiability in 
ACC extended with transitive roles (i.e., the ability to say that cer- 
tain roles are interpreted as transitive relations) is in PSpace [Satt- 
ler, 1996]. An extension of this Description Logic with role hierar- 
chies was implemented in the Description Logic system FaCT [Horrocks, 
1998a]. Although pure concept satisfiability of this extension is ExpTime- 
hard, its highly optimized implementation behaves quite well [Horrocks, 
1998b]. 


Inverse roles: Without the converse operator on programs/time (or the 
inverse operator on roles), binary relations are restricted to be used asym- 
metrically: for example, one is restricted to modeling either “into the future” 
or “into the past”, or one must decide whether to use a role “has-child” or 
“is-child-of” , but may not use both and relate them in the proper way. Hence 
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in both modal and Description Logics, the converse/inverse operator plays 
an important role since it overcomes this asymmetry, and a variety of logics 
allowing this operator have been investigated [Streett, 1982; Vardi, 1985; 
De Giacomo and Massacci, 1996; Calvanese, 1996a; De Giacomo, 1996; 
Horrocks et al, 1999]. 


4-2.3 Guarded fragments 

Andreka et al. [1996] introduce guarded fragments as natural generalizations 
of modal logics to relations of arbitrary arity. Their definition and investi- 
gation was motivated by the question why modal logics have such “nice” 
properties, e.g., finite axiomatiz ability, Craig interpolation, and decidabil- 
ity. Guarded fragments are obtained from first-order logic by allowing the 
use of quantified variables only if these variables are guarded by appropri- 
ate atoms 11 before they are used in the body of a formula. More precisely, 
quantifiers are restricted to appear only in the form 

3y(P(x, y) A $(y)) or Vy(P(x, y) D <&(y)) (First Guarded Fragment) 

3y(P(x, y) A $(x, y)) or Vy(P(x,y) D <f>(x,y)) (Guarded Fragment) 

for atoms P, vectors of variables x and y, and (first) guarded fragment 
formulae $ with free variables in y and x (resp. in y). The loosely guarded 
fragment further allows a restricted form of conjunction as guards. 

Obviously, the translation (3y.R(x,y) A ip(y))(x) of the K formula Oip 
(or of the ACC concept 3 R.C<p) is a formula in the first guarded fragment 
since the quantified variable y is “guarded” by R. A more complex guarded 
fragment formula is 

3z±, Z2-(parents(a;, zi, zf) A (married^, Z 2 ) A (Vy.parents(y, z\, zf) A rich(y)))) 

in one free variable x, a guard atom parents, and describing all those persons 
that have married parents and whose siblings (including oneself) are rich. 

All guarded fragments have been shown to be decidable [Andreka et al., 
1996]. Gradel [1999] proves that satisfiability of the guarded fragment is 
in ExpTime - provided that the arity of the predicates is bounded - and 
2ExpTiME-complete for unbounded signatures. Interestingly, the guarded 
fragment was shown to remain in 2ExpTime when extended with fixpoints 
[Gradel and Walukiewicz, 1999]. These “nice” properties together with their 
close relationship to modal logics and Description Logics suggest that they 
are a good starting point for the development of a Description Logic with 
n - ary predicates [Gradel, 1998]: in [Lutz et al., 1999], a restriction of the 


11 Atoms are formulae P(x i, . . . , x^) where P is a k - ary predicate symbol and Xj are variables. 
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guarded fragment was proven to be PSPACE-complete, where the restric- 
tion concerns the way in which variables are used in guard atoms. Roughly 
speaking, each predicate A comes with a two- fold arity (i, j) and, when A 
is used as a guard, either all first i variables are quantified and none of the 
last j are or, symmetrically, all last j variables are quantified and none of 
the first i are. Hence one might think of the predicates as having two-fold 
“groupings”. A similar logic, the so-called action-guarded fragment AGF, is 
proposed in [Gonqalves and Gradel, 2000]: it comes with a similar grouping 
of variables in predicates (which is, when extended with “inverse actions”, 
the same as the grouping in [Lutz et al, 1999]) and, additionally, it divides 
predicates into those allowed as guards and those allowed in the body of 
formulae. From a DL perspective, this should not be too severe a restric- 
tion since it parallels the distinction between role and concept names. In- 
terestingly, the extension of AGF with counting quantifiers (the first-order 
counterpart of number restrictions), inverse actions, and fixpoints yields 
an ExpTime logic - provided that the arity of the predicates is bounded 
and that numbers in counting quantifiers are coded unarily [Gonqalves and 
Gradel, 2000]. This result is even more interesting when we note that the 
guarded fragment, when extended with number restrictions, functional re- 
strictions, or transitivity (i.e., statements saying that certain binary re- 
lations are to be interpreted as transitive relations) becomes undecidable 
[Gradel, 1999]. 

To the best of our knowledge, the only other n-ary Description Logics with 
sound and complete inference algorithms are VCR, [Calvanese et al, 1998a] 
and VCTZf, [Calvanese et al, 1999c], which seem to be orthogonal to the 
guarded fragment. An exact description of the relationship between VCR 
(resp. VCR,fj,) and the guarded fragment (resp. its extension with fixpoints) 
is missing so far. 


4.3 Database models 

In this section we will describe the relationship between Description Logics 
and data models used in databases. We will consider both traditional data 
models used in the conceptual modeling of an application domain, such as 
semantic and object-oriented data models, and more recently introduced 
formalisms for representing semistructured data and data on the web. We 
will concentrate on the relationship between the formalisms and refer to 
Chapter 16 for a more detailed discussion on the use of Description Logics 
in data management [Borgida, 1995]. 
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Fig. 4.8. An Entity-Relationship schema. 


4-3.1 Semantic data models 

Semantic data models were introduced primarily as formalisms for database 
schema design [Abrial, 1974; Chen, 1976], and are currently adopted in most 
of the database and information system design methodologies and Computer 
Aided Software Engineering (CASE) tools [Hull and King, 1987; Batini et al, 
1992], In semantic data models, classes provide an explicit representation 
of objects with their attributes and their relationships to other objects, 
and subtype-supertype relationships are used to specify the inheritance of 
properties. Here, we concentrate on the Entity-Relationship (ER) model 
[Chen, 1976; Teorey, 1989; Batini et al., 1992; Thalheim, 1993], which is one 
of the most widespread semantic data models. However, the considerations 
we make hold also for other formalisms for conceptual modeling, such as 
UML class diagrams [Rumbaugh et al., 1998; Jacobson et al., 1998]. 

4-3. 1.1 Formalization 

The basic elements of the ER model are entities, relationships, and at- 
tributes, which are used to model the domain of interest by means of an 
ER schema. 

Figure 4.8 shows a simple ER schema representing the registration of cus- 
tomers for (telephone) services provided by departments (e.g., of a telephone 
company). The schema is drawn using the standard graphical ER notation, 
in which entities are represented as boxes, and relationships as diamonds. An 
attribute is shown as a circle attached to the entity for which it is defined. An 
entity type (or simply entity) denotes a set of objects, called its instances , 
with common properties. Elementary properties are modeled through at- 
tributes, whose values belong to one of several predefined domains, such as 
Integer, String, Boolean, etc. Relationships between instances of different en- 
tities are modeled through relationship types (or simply relationships). A 
relationship denotes a set of tuples, each one representing an association 
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among a combination of instances of the entities that participate in the re- 
lationship. The participation of an entity in a relationship is called an ER 
role and has a unique name. It is depicted by connecting the relationship to 
the participating entity. The number of ER roles for a relationship is called 
its arity. 

Cardinality constraints can be attached to an ER role in order to restrict 
the minimum or maximum number of times an instance of an entity may 
participate via that ER role in instances of the relationship [Abrial, 1974; 
Grant and Minker, 1984; Lenzerini and Nobili, 1990; Ferg, 1991; Ye et al., 
1994; Thalheim, 1992; Calvanese and Lenzerini, 1994b]. Minimal and max- 
imal cardinality constraints can be arbitrary non-negative integers. How- 
ever, typical values for minimal cardinality constraints are 0, denoting no 
constraint, and 1, denoting mandatory participation of the entity in the 
relationship; typical values for maximal cardinality constraints are 1, de- 
noting functionality, and oo, denoting no constraint. In Figure 4.8, car- 
dinality constraints are used to impose that each customer must be reg- 
istered for at least one service. Also, each service is provided by exactly 
one department, which in turn may not provide more than 20 different 
services. 

To represent inclusions between the sets of instances of two entities or 
two relationships, so called IS -A relations are used. An IS-A relation states 
the inheritance of properties from a more general entity (resp. relationship) 
to a more specific one. A generalization is a set of IS-A relations which 
share the more general entity (resp. relationship). Multiple generalizations 
can be combined in a generalization hierarchy. A generalization can be mu- 
tually exclusive , meaning that all the specific entities (resp. relationships) 
are mutually disjoint, or complete, meaning that the union of the more spe- 
cific entities (resp. relationships) completely covers the more general entity 
(resp. relationship). In Figure 4.8, a mutually exclusive and complete gen- 
eralization is used to represent the fact that customers are partitioned into 
private and business customers. 

Additionally, keys are used to represent the fact that an instance of an 
entity is uniquely identified by a certain set of attributes, or that an instance 
of a relationship is uniquely identified by a set of instances of the entities 
participating in the relationship. 

Although we do not provide a formal definition here, the semantics of an 
ER schema can be given by specifying which database states are consistent 
with the information structure represented by the schema; for details see 
e.g., [Calvanese et al., 1999e]. 
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Traditionally, the ER model has been used in the design phase of com- 
mercial applications, and modern CASE tools usually provide sophisticated 
schema editing facilities and automatic generation of code for the inter- 
action with the database management system. However, these tools do not 
provide any support beyond the graphical user interface, for dealing with the 
complexity of schemas. In particular, the designer is responsible for check- 
ing schemas for important properties such as consistency and redundancy. 
This may be a complex and time-consuming task if performed by hand. By 
translating an ER schema into a DL knowledge base in such a way that the 
verification of schema properties corresponds to traditional DL reasoning 
tasks, the reasoning facilities of a DL system can be profitably exploited to 
support conceptual database design. 

4-3. 1.2 Correspondence with Description Logics 
Both in Description Logics and in the ER model, the domain of interest 
is modeled through classes and relationships, and various proposals have 
been made for establishing a correspondence between the two formalisms. 
Bergamaschi and Sartori [1992] provide a translation of ER schemas into 
acyclic ALM knowledge bases. However, due to the limited expressiveness of 
the target language, several features of the ER model and desired reasoning 
tasks could not fully be captured by the proposed translation. Indeed, when 
relating the ER model to Description Logics, one has to take into account 
the following aspects: 

1. The ER model allows relations of arbitrary arity, while in traditional Descrip- 
tion Logics only unary and binary relations are considered. 

2. The assumption of acyclicity is unrealistic in an ER schema, while it is common 
in DL knowledge bases. 

3. Database states are considered to be finite structures, while no assumption 
on finiteness is usually made on the interpretation domain of a DL knowledge 
base. 

Before discussing these issues in more detail, we show in Figure 4.9 part of 
the ACLtMZ knowledge base corresponding to the ER schema in Figure 4.8, 
derived according to the translation proposed by Calvanese et al. [1994; 
1999e]. We have omitted the part corresponding to the translation of most 
attributes, showing as an example only the translation of the attribute name 
of the entity Customer. 

Due to point 1, when translating ER schemas into knowledge bases of a 
traditional Description Logic, it becomes necessary to reify relationships, 
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Registration C 

Supply C 

Customer C 
Location C 
Service □ 

Department C 
Customer C 
BusinessCustomer C 
PrivateCustomer C 
Customer C 


VcustRegistration. Customer n = 1 custRegistration n 
VIocRegistration. Location n = 1 locRegistration n 
VservRegistration. Service n = 1 servRegistration 

VservSupply. Service n = 1 servSupply n 
VcomSupply. Customer n = 1 comSupply 

VcustRegistration - . Registration n ^ 1 custRegistration 
VIocRegistration - . Registration 
VservRegistration - . Registration n 
VservSupply - . Supply n = 1 servSupply - 

VcomSupply - .Supply n ^ 20 comSupply - 
BusinessCustomer U PrivateCustomer 
Customer 

Customer n ->BusinessCustomer 
Vname. String n = 1 name 


Fig. 4.9. Part of the knowledge base corresponding to the Entity-Relationship 
schema in Figure 4.8. 


i.e., to translate each relationship into a concept whose instances represent 
the tuples of the relationship. Each entity is also translated into a concept, 
while each ER role is translated into a Description Logic role. Then, using 
functional roles, one can enforce that each instance of the atomic concept C 
corresponding to a relationship R represents a tuple of R, i.e., for each role 
representing an ER role of R, the instance of C is connected to exactly one 
instance of the entity associated to the ER role. 

There is, however, one condition, which is implicit in the semantics of the 
ER model, but which does not necessarily hold once relationships are reified, 
and which can also not be enforced in Description Logics on the models of a 
knowledge base: the condition is that the extension of a relationship R does 
not contain some tuple twice. After reification this corresponds to the fact 
that there are no two instances of the concept corresponding to R that are 
connected through all roles of R exactly to the same instances of the entities 
associated to the roles. However, it can be shown that, when reasoning on a 
knowledge base corresponding to an ER schema, nothing is lost by ignoring 
this condition. Indeed, given an arbitrary model of such a knowledge base, 
one can always find a model in which the condition holds, and thus one 
that corresponds directly to a legal database state [Calvanese et al., 1994; 
De Giacomo, 1995; Calvanese et al, 1999e]. 

Cardinality constraints are translated using number restrictions on the 
inverse of the roles connecting relationships to entities. To avoid the need 
for qualified number restrictions, in the translation in Figure 4.9 we have 
disambiguated the roles by appending to their name the name of the re- 
lationship they belong to. An alternative would be to allow the same role 
to appear in several places, and use qualified number restrictions instead 
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of unqualified ones. While considerably complicating the language, this 
makes it possible to translate also IS-A relations between relationships, 
which cannot be captured using the translation proposed by Calvanese 
et al. [l999e]. Also more general forms of cardinality constraints have been 
proposed for the ER model [Thalheim, 1992], allowing one, e.g., to limit 
the number of locations a customer may be registered for, independently of 
the service. To the best of our knowledge, such types of cardinality cons- 
traints cannot be captured in Description Logics in general. Borgida and 
Weddell [1997] have studied reasoning in Description Logics in the presence 
of functional dependencies that are more general than unary ones, and which 
allow one to represent keys of relations. Decidability of reasoning in a very 
expressive Description Logic augmented with non-unary key constraints has 
been shown by Calvanese et al. [2000b], and Calvanese et al. [2001a] have 
shown that also general functional dependencies can be added without losing 
ExpTlME-conrpleteness. 

IS-A relations are simply translated using concept inclusion assertions. 
Generalization hierarchies additionally require negation, if they are mutually 
disjoint, and union, if they are complete. 

With respect to point 2, we observe that the translation of an ER schema 
containing cycles obviously gives rise to a cyclic DL knowledge base. How- 
ever, due to the necessity of properly relating a relationship via an ER role to 
an entity, even when translating an acyclic ER schema, the resulting knowl- 
edge base contains cycles. On the other hand, it is sufficient to use inclusion 
assertions rather than equivalence, since the former naturally correspond to 
the semantics of ER schemas. 

With respect to point 3, we observe that one cannot simply ignore it and 
adopt algorithms that reason with respect to arbitary models. Indeed, the 
ER model itself does not have the finite model property [Cosmadakis et al., 
1990; Calvanese and Lenzerini, 1994b], which states that, if a knowledge 
base (resp. schema) has an arbitrary, possibly infinite model (resp. database 
state), then it also has a finite one (see also Chapter 5 for more details). A 
further confirmation comes from the fact that, for correctly capturing ER 
schemas in Description Logics, possibly cyclic knowledge bases expressed in 
a Description Logic including functional restrictions and inverse roles are 
required, and such knowledge bases do not have the finite model property 
[Calvanese et al., 1994; 1999e]. Therefore one must resort to techniques for fi- 
nite model reasoning. Calvanese et al. [1994] show that reasoning w.r.t. finite 
models in AC1AMT knowledge bases containing only inclusion assertions is 
ExpTiME-complete, and Calvanese [1996a] presents a 2ExpTime algorithm 
for reasoning in ACCQI knowledge bases with general inclusion assertions. 
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4- 3. 1.3 Applications of the correspondence 
The study of the correspondence between Description Logics and semantic 
data models has led to significant advantages in both fields. On the one hand, 
the richness of constructs that is typical of Description Logics makes it pos- 
sible to add them to semantic data models and take them fully into account 
when reasoning on a schema [Calvanese et al., 1998g] . Notable examples are: 

• the ability to specify not only IS-A and generalization hierarchies, but also arbi- 
trary Boolean combinations of entities or relationships, which can correspond to 
forms of negative and incomplete knowledge [Di Battista and Lenzerini, 1993]; 

• the ability to refine properties along an IS-A hierarchy, such as restricting the 
numeric range for cardinality constraints, or refining the participation in rela- 
tionships using universal quantification over roles; 

• the ability to define classes by means of equality assertions, and not only to state 
necessary properties for them. 

The correspondence between semantic data models and Description Logics 
has been recently exploited to add such advanced capabilities to CASE tools. 
A notable example is the l«COM tool [Franconi and Ng, 2000] for conceptual 
modeling, which combines a user-friendly graphical interface with the ability 
to automatically infer properties of a schema (e.g., inconsistency of a class, 
or implicit IS-A relations) by invoking the FaCT Description Logic reasoner 
[Horrocks, 1998a; 1999]. 

On the other hand, the basic ideas behind the translation of semantic 
data models into Description Logics, namely reification and the fact that 
one can restrict the attention to models in which distinct instances of a rei- 
fied relation correspond to distinct tuples, have led to the development of 
Description Logics in which relations of arbitrary arity are first class citizens 
[De Giacomo and Lenzerini, 1994c; Calvanese et al ., 1997; 1998a]. Using such 
Description Logics, the translation of an ER schema is immediate, since now 
relationships of arbitrary arity also have their direct counterpart. For exam- 
ple, using VCR [Calvanese et al, 1998a], the part of the schema in Figure 4.8 
relative to the ternary relation Registration can be translated as follows: 

Registration C ($1: Customer) n ($2: Location) n ($3: Service) 
Customer C 3[$1] Registration. 

We refer to Chapter 16, Subsection 16.2.2 for the details of the translation. 

Description Logics could also be considered as expressive variants of 
semantic data models with incorporated reasoning facilities. This is of 
particular importance in the context of information integration, where a 
high expressiveness is required to capture in the best possible way the 
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complex relationships that hold between data in different information 
sources [Levy et al, 1995; Calvanese et al, 1998d; 1998e]. 


4-3.2 Object-oriented data models 

Object-oriented data models have been proposed recently with the goal of 
devising database formalisms that could be integrated with object-oriented 
programming systems [Abiteboul and Kanellakis, 1989; Kim, 1990; Cattell 
and Barry, 1997; Rumbaugh et al., 1998]. Object-oriented data models rely 
on the notion of object identifier at the extensional level (as opposed to 
traditional data models which are value-oriented) and on the notion of class 
at the intensional level. The structure of the classes is specified by means 
of typing and inheritance. Since we aim at discussing the relationship with 
Description Logics, which are well-suited to describe structural rather than 
dynamic properties, we restrict our attention to the structural component of 
object-oriented models. Hence we do not consider all those aspects that are 
related to the specification of the behavior and evolution of objects, which 
nevertheless constitute an important part of these data models. Although 
in our discussion we do not refer to any specific formalism, the model we 
use is inspired by the one presented by Abiteboul and Kanellakis [1989], 
and embodies the basic features of the static part of the ODMG standard 
[Cattell and Barry, 1997]. 


4-3.2. 1 Formalization 

An object-oriented schema is a finite set of class declarations, which im- 
pose constraints on the instances of the classes that are used to model the 
application domain. A class declaration for a class C has the form 

class C is-a C\, ... ,Ck type-is T, 

where the is-a part, which is optional, specifies inclusions between the sets of 
instances of the classes involved, while the type-is part specifies through the 
type expression T the structure assigned to the objects that are instances 
of the class. We consider union , set , and record types, built according to the 
following syntax, where the letter A is used to denote attributes: 

T — >C | 

union 7) , . . . , T). end | 
set-of T | 

record A\: Xj, . . . , A&: T \ end . 

Figure 4.10 shows part of an object-oriented schema modeling the same 
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class Registration type-is 
record 

cust: Customer, 
regis: set-of record 

serv: Service 
loc: Location 
end 

end 

class Service type-is 
record 

code: Integer, 
suppliedBy: Department 
end 


class Customer type-is 

union BusinessCustomer, PrivateCustomer 
end 

class PrivateCustomer is-a Customer type-is 
record 

SSN: String 
end 


Fig. 4.10. Part of an object-oriented schema. 


reality as the Entity-Relationship schema of Figure 4.8. Notice that now 
registrations are represented as a class and grouped according to the cus- 
tomer, since all registrations related to one customer are collected in the 
set-valued attribute regis. 

The meaning of an object-oriented schema is given by specifying the char- 
acteristics of a database state for the schema. The definition of a database 
state makes use of the notions of object identifier and value. Starting from a 
finite set O ^ of object identifiers, the set of complex values over O ^ is built 
inductively by grouping values into finite sets and records. A database state 
J for a schema is constituted by the set of object identifiers, a mapping 
assigning to each class a subset of . and a mapping p ^ assigning to each 
object in O ^ a value over . 

Notice that, although the set of values that can be constructed from a 
set O ^ of object identifiers is infinite, for a database state one only needs 
to consider the finite subset Vj of values assigned by p ^ to the elements 
of , including the values that are not explicitly associated with object 
identifiers, but are used to form other values. 

The interpretation of type expressions in a database state J is defined 
through an interpretation function that assigns to each type expression 
T a set T ^ of values in Vj as follows: 


• if T is a class C, then T J = n J (C); 

• if T is a union type union Tj , . . . , T*, end , then T J = Tfi U • • • U Tff ; 

• it T is a record type (resp. set type), then T J is the set of record values (resp. set 
values) compatible with the structure of T. For records we are using an open 
semantics, meaning that the records that are instances of a record type may 
have more components than those explicitly specified in the type [Abiteboul and 
Kanellakis, 1989]. 
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A database state J for an object-oriented schema S is said to be legal 
(with respect to S) if for each declaration 

class C is-a C\, ... ,C n type- is T 

in S, we have (1) C Cf for each i G { 1, . . . , n}, and (2) (C^) C . 

Therefore, for a legal database state, the type expressions that are present 
in the schema determine the (finite) set of values that must be considered. 
The construction of such values is limited by the depth of type expressions. 

4- 3. 2. 2 Correspondence with Description Logics 
When establishing a correspondence between an object-oriented model such 
as the one presented above, and Description Logics, one must take into ac- 
count that the interpretation domain for a DL knowledge base consists of 
atomic objects, whereas each object of an object-oriented schema is assigned 
a possibly structured value. Therefore one needs to explicitly represent in 
Description Logics the type structure of classes [Calvanese et al, 1994; 1999e; 
Artale et al., 1996a]. We describe now the translation proposed by Calvanese 
et al. [1994; 1999e], that overcomes this difficulty by introducing into the 
DL knowledge base concepts and roles with a specific meaning: the concepts 
AbstractClass, RecType, and SetType are used to denote instances of classes, 
record values, and set values, respectively. The associations between classes 
and types induced by the class declarations, as well as the basic characteris- 
tics of types, are modeled by means of specific roles: the functional role value 
models the association between classes and types, and the role member is 
used for specifying the type of the elements of a set. Moreover, the concepts 
representing types are assumed to be mutually disjoint, and disjoint from 
the concepts representing classes. These constraints are expressed by the 
following inclusion assertions, which are always part of the knowledge base 
that is obtained from an object-oriented schema: 

AbstractClass C = lvalue 
RecType C Vvalue.T 
SetType C Vvalue.T n -iRecType. 

The translation from object-oriented schemas to Description Logic knowl- 
edge bases is defined through a mapping T, which maps each type expression 
to a concept expression as follows: 

• Each class C is mapped to an atomic concept r(C). 

• Each type expression union Tj, . . . , Tk end is mapped to r(Ti) U • • • U r(Tfc). 

• Each type expression set-of T is mapped to SetType fl Vmember.r(T). 
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Customer C AbstractClass n Vvalue.(BusinessCustomer U PrivateCustomer) 

PrivateCustomer C AbstractClass n Customer n Vvalue.(RecType n = 1 SSN n VSSN. String) 

Service C AbstractClass n 

Vvalue.(RecType n = 1 code n Vcode. Integer n 

= 1 suppliedBy n VsuppliedBy. Department) 

Customer □ AbstractClass n 

Vvalue.(RecType n = 1 cust n Vcust. Customer n 
= 1 regis n Vregis.(SetType n 

Vmember.(RecType n 

= 1 serv n Vserv. Service n 
= 1 loc n Vloc. Location))) 

Fig. 4.11. The specific part of the knowledge base corresponding to the object- 
oriented schema in Figure 4.10. 

• Each attribute A is mapped to an atomic role r(A), and each type expression 
record Ap Ti, . . . , A^: Tf. end is mapped to 

RecType n vr(^l 1 ).r(T 1 ) n = l r(vl 1 ) n • • • n 
vr(A fe ).r(T fe ) n = ir(A fc ). 

Then, the knowledge base T(«S) corresponding to an object-oriented schema 
S is obtained by taking for each class declaration 

class C is-a C\, ... ,C n type-is T 

an inclusion assertion 

T(C) c AbstractClass n r(C'i) n • • • n T(C n ) n Vvalue.r(T). 

We show in Figure 4.11 the knowledge base resulting from the translation 
of the fragment of object-oriented schema shown in Figure 4.10. 

Analogously to the ER model, it is sufficient to use inclusion assertions 
instead of equivalence assertions to capture the semantics of object-oriented 
schemas. A translation to an acyclic knowledge base is possible under the 
assumption that no class in the schema refers to itself, either directly in its 
type or indirectly via the class declarations 12 [Artale et al, 1996a]. However, 
since this assumption represents a rather strong limitation in expressiveness, 
cycles are typically present in object-oriented schemas, and in this case the 
resulting DL knowledge base will contain cyclic assertions. No inverse roles 
are needed for the translation, since in object-oriented models the inverse of 
an attribute is rarely considered. Furthermore, the use of number restrictions 
is limited to functionality, since all attributes are implicitly functional. 

To establish the correctness of the transformation, and thus ensure that 
the reasoning tasks on an object-oriented schema can be reduced to reason- 
ing tasks on its translation into Description Logics, we would like to establish 

12 Note that cyclic references cannot appear directly in a type, which is constructed inductively, 
but only through the class declarations. 



Relationships with other Formalisms 


185 


a one-to-one correspondence between database states legal for the schema 
and models of the knowledge base resulting from the translation. However, 
as for the ER model, the knowledge base may have models that do not cor- 
respond directly to legal database states. In this case, this is due to the fact 
that, while values have a treelike structure, the corresponding individuals in 
a model of the Description Logic knowledge base may be part of cyclic sub- 
structures. One way of ruling out such cyclic substructures would be to adopt 
a specific constructor that allows one to impose well-foundedness [Calvanese 
et al, 1995], or even exploit general fixed points on concepts [Schild, 1994; 
De Giacomo and Lenzerini, 1994a; 1997; Calvanese et al, 1999c]. However, 
it turns out that, in this case, it is not necessary to explicitly enforce such 
a condition. Indeed, due to the finite depth of nesting of types in a schema, 
it can be shown that each model of the translation of the schema can be 
unfolded into one that directly corresponds to a legal database state (more 
details are provided by Calvanese et al. [l999e]). 

4- 3. 2. 3 Applications of the correspondence 

Similarly to the ER model, the existence of property-preserving transfor- 
mations from object-oriented schemas into DL knowledge bases makes it 
possible to exploit the reasoning capabilities of a DL system for checking rel- 
evant schema properties, such as consistency and redundancy [Bergamaschi 
and Nebel, 1994; Artale et al, 1996a; Calvanese et al, 1998g]. Additionally, 
several extensions of the object-oriented formalism that are useful for the 
purpose of conceptual modeling can be considered: 

• Not only IS-A, but also disjointness, and, more generally, Boolean combinations 
of classes can be used. 

• Class definitions can be used to specify not only necessary but also necessary and 
sufficient properties for an object to be an instance of a class [Bergamaschi and 
Nebel, 1994], 

• Cardinality constraints and not only implicit functionality can be imposed on 
attributes. Having attributes with multiple values could in some cases be a useful 
alternative to set-valued attributes. 

• By admitting also the use of inverse roles in the language, one gains the ability to 
impose constraints using a relation in both directions, as is customary in semantic 
data models. The increase in expressiveness that one obtains this way has indeed 
been recognized as extremely important by the database community [Albano 
et al, 1991], and has been included in the recent ODMG standard [Cattell and 
Barry, 1997]. 

The basic characteristics of object-oriented data models have also been 
included in the structural part of the Unified Modeling Language (UML) 



186 


U. Sattler, D. Calvanese, and R. Molitor 


[Rumbaugh et al, 1998; Jacobson et al, 1998], which is becoming the stan- 
dard language for the analysis phase of software and information system 
development. Additionally, UML allows the definition of generic recursive 
data structures (both inductive and co-inductive) such as lists and trees, and 
their specialization to specific types. In order to capture also these aspects of 
UML in Description Logics and take them fully into account when reasoning 
over a schema, the Description Logic must provide the ability to represent 
and reason over data structures. In particular, to represent UML schemas, it 
is necesary to resort to very expressive Description Logics including number 
restrictions, inverse roles or n-ary relations, and fixpoint constructs on con- 
cepts [Calvanese et al., 1999c]. Also in this case, the reasoning services pro- 
vided by a DL system can be integrated into CASE tools and profitably ex- 
ploited to support the designer in the analysis phase [Franconi and Ng, 2000] . 


4-3.3 Semistructured data models and XML 

In recent application areas such as data integration, access to data on the 
web, and digital libraries, the structure of the data is usually not rigid, as in 
conventional databases, and thus it is difficult to describe it using traditional 
data models. Therefore, so called semistructured data models have been pro- 
posed, which are graph-based data models that provide flexible structuring 
mechanisms, and thus allow one to represent data that is neither raw nor 
strictly typed [Abiteboul et al., 2000; Abiteboul, 1997; Buneman et al., 1997; 
Mendelzon et al., 1997]. The Extensible Markup Language (XML) [Bray 
et al., 1998; Abiteboul et al., 2000], which has been introduced as a mech- 
anism for representing structured documents on the Web, can in fact also 
be considered a model for semistructured data. Indeed, XML is by now the 
most popular model for data on the Web, and there is a tremendous ef- 
fort related to XML and the associated standards, 13 both in the research 
community and in industry. 

Description Logics have traditionally been used to describe and organize 
data in a more flexible way than is done in databases, basically using graph- 
like structures. Hence it seems natural to adopt Description Logics and the 
associated reasoning services for representing and reasoning on semistruc- 
tured data and XML as well. In the following, we discuss the (rather few) 
proposals made in the literature. What these proposals have in common 
is the necessity to resort to fixpoints, either by adopting fixpoint seman- 
tics [Nebel, 1991; Baader, 1991], or by using reflexive-transitive closure or 


13 


http : //www . w3 . org/ 
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explicit fixpoint constructs [De Giacomo and Lenzerini, 1997] (see also Chap- 
ter 5). 

For the recent extensive work on the use of Description Logics to provide a 
semantically richer representation of data on the Web we refer to Chapter 14. 


4-3.3. 1 Relationship between semistructured data and Description Logics 
Michaeli et al. [1997] propose to extend a semistructured data model that 
is an abstraction of the OEM model [Abiteboul et al., 1997] with a layer of 
classes, representing objects with common properties. Class expressions cor- 
respond to DL concepts and the properties for the classes are specified by a 
set of classification rules, which provide sufficient conditions for class mem- 
bership and are interpreted under a least fixpoint semantics. By a reduction 
to reasoning in a Description Logic with fixpoint operators [De Giacomo and 
Lenzerini, 1997; Calvanese et al., 1999c], it is shown that determining class 
satisfiability and containment under a set of rules is ExpTiME-decidable 
(and in fact ExpTiME-complete) . 

In the following, we discuss in more detail the use of Description Logics 
to represent and reason on semistructured data, on the example of one typ- 
ical representative for semistructured data models. In semistructured data 
models, data is organized in the form of a graph, and information on both 
the values and the schema for the data is attached to the edges of the graph. 
In the formalism proposed by Buneman et al. [1997], the labels of edges in a 
schema are formulae of a complete first-order theory, and the conformance 
of a database to a schema is defined in terms of a special relation, called 
simulation. The notion of simulation is less rigid than the usual notion of 
satisfaction, and suitably reflects the need for dealing with less strict data 
structures. In order to capture in Description Logics the notion of simula- 
tion, it is necessary on the one hand to express the local conditions that 
a node must satisfy, and on the other hand to deal with the fact that the 
simulation relation is the greatest relation satisfying the local conditions. 
Since semistructured data schemas may contain cycles, the local conditions 
may depend on each other in a cyclic way. Therefore, while the local condi- 
tions can be encoded by means of suitable inclusion assertions in ACU, the 
maximality condition on the simulation relation can only be captured cor- 
rectly by resorting to a greatest fixpoint semantics [Calvanese et al., 1998c; 
1998b]. Then, using a Description Logic with fixpoint constructs, such as 
pACCQ [De Giacomo and Lenzerini, 1994b; 1997] (see also Chapter 5), a 
so-called characteristic concept for a semistructured data schema can be con- 
structed, which captures exactly the properties of the schema. Subsumption 
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<?xml version="l .0"?> 

<!DOCTYPE Customers SYSTEM "services . dtd"> 

<Customers> 

<Customer type="business"> 

<Name>FIAT</Name> 

<Field>manuf acturing</Field> 

<Registered service="522"> 

<Locat ion><City>Torino</ City> 

<Address>. . .</Address> 

</Location> 

<Location>. . . </Location> 

</Registered> 

<Registered service="612"> 

<Location>. . . </Location> 

</Registered> 

</Customer> 

<Customer type="private"> 

<Name>. . .</Name> 

<SSN> . . . </SSN> 

<Registered service="214"> 

<Location>. . . </Location> 

</Registered> 

</Customer> 

</Customers> 

Fig. 4.12. Two XML documents specifying respectively customers and services. 

between two schemas, which is the task of deciding whether every semistruc- 
tured database conforming to one schema also conforms to another schema 
[Buneman et al. , 1997], can be decided by checking subsumption between 
the characteristic concepts of the schemas [Calvanese et al, 1998c]. 

The correspondence with Description Logics can again be exploited to 
enrich semistructured data models, without losing the ability to check 
schema subsumption. Indeed, the requirement already raised by Buneman 
et al. [1997], to extend semistructured data models with several types of 
constraints, has been addressed by Calvanese et al. [1998b], who propose 
several types of constraints, such as existence and cardinality constraints, 
which are naturally derived from DL constructs. Reasoning in the presence 
of constraints is done by encoding also the constraints in the characteristic 
concept of a schema. Calvanese et al. deal also with the presence of incom- 
plete information in the theory describing the properties of edge labels, by 
proposing the use of a theory expressed in /. lACCQ , instead of a complete 
first order theory. 


<?xml version=" 1 . 0"?> 

< ! DOCTYPE Services SYSTEM "services . dtd"> 
<Services> 

department name=" standard-services "> 
<Service code="522"> 

<Name>call-back when busy</Name> 
<Cost>. . .</Cost> 

</Service> 

<Service code="214"> 

<Name>three-party call</Name> 
</Service> 

</Department> 

department name="business-services"> 
<Service code="612"> 

<Name>conf erence call</Name> 
</Service> 

</Department> 

</Services> 


4- 3. 3. 2 Relationship between XML and Description Logics 

XML [Bray et al., 1998] is a formalism for representing documents that 
are structured by means of nested tags. Recently, XML has gained 
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<! — File: services. dtd — > 

<! ELEMENT Customers (Customer) + > 

<! ELEMENT Customer (Name, (Field ISSN), Registered+) > 
<! ELEMENT Registered (Location) + > 

<! ELEMENT Services (Department) + > 

<! ELEMENT Department (Service)* > 

<! ELEMENT Service (Name, Cost?, ...) > 

<! ELEMENT Name #PCDATA > 


<!ATTLIST Customer type (business I private) "private"> 
<!ATTLIST Registered service IDREF #REQUIRED> 
<!ATTLIST Department name CDATA #REQUIRED> 
< ! ATTLIST Service code ID #REQUIRED> 


Fig. 4.13. Part of the Document Type Declaration S for the XML documents in 
Figure 4.12. 

popularity also as a formalism for representing (semistructured) data and 
exchanging it over the Web. Figure 4.12 shows two example XML docu- 
ments containing respectively data about customers and their registration 
to services provided by various departments (e.g., of a telephone company). 
A part of an XML document consisting of a start tag (e.g., <Customer>), the 
matching end tag (e.g., </Customer>), and everything in between is called 
an element. Elements can be arbitrarily nested, and can have associated 
attributes, specified by means of attribute-value pairs inside the start tag 
(e.g., type="business"). Intuitively, each XML document can be viewed as 
a finite ordered unranked tree, 14 where each element represents a node, and 
the children of an element are those elements directly contained in it. How 
XML documents are viewed as trees is defined, together with an API for 
accessing and manipulating such trees/XML-documents, by the Document 
Object Model, 15 which defines, besides element nodes, other types of nodes, 
such as attributes, comments, etc. 

In XML, it is possible to impose a structure on documents by means of 
a Document Type Declaration (DTD) [Bray et al, 1998]. A DTD consists 
of a set of declarations: For each element type used in the XML document, 
the DTD must contain a declaration that specifies, by means of a regular 
expression, how elements can be nested within elements of that type. The 
keyword #PCDATA is used to specify that the element content (i.e., the part 
enclosed by the tags) is free text without nested elements. For each attribute 

14 In an unranked tree each node can have an arbitrary finite number of child nodes. The tree is 

ordered since the order among children of the same node matters. 

15 http://www.w3.org/D0M/ 
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appearing in the XML document, the DTD must contain a declaration spec- 
ifying the name of the attribute, the type of the elements it is associated 
to, and additional properties (e.g., the type and whether the attribute is 
optional or mandatory). Figure 4.13 shows part of the DTD for the XML 
documents in Figure 4.12. We refer to [Bray et ai, 1998] for a precise defi- 
nition of the syntax and semantics of XML DTDs. 

We illustrate the method for encoding XML DTDs into DL knowledge 
bases proposed in [Calvanese et ai, 1999d]. For simplicity, we do not consider 
XML attributes, although they can easily be dealt with by introducing suit- 
able roles. Due to the presence of regular expressions, to encode DTDs into 
Description Logics, it is necessary to resort to a Description Logic equipped 
with constructs for building regular expressions over roles (see Chapter 5). 
Notice that the encoding of DTDs into DL knowledge bases must allow for 
representing unranked trees and at the same time for preserving the order 
of the children of a node. For example, the DTD in Figure 4.13 enforces that 
the content of a Customer element consists of a Name element, followed by 
(in DTDs, concatenation is denoted by either a Field or an SSN ele- 
ment ( alternative is denoted by “ | ” ) , followed by an arbitrary number (but 
at least one) of Registered elements ( transitive closure is denoted by “+”). 
To overcome these difficulties, Calvanese et al. [l999d] propose to represent 
XML documents (i.e. , ordered unranked trees) by means of binary trees, 
and provide an encoding of DTDs in Description Logics that exploits such 
a representation. Figure 4.14 shows the binary tree corresponding to one of 
the XML documents in Figure 4.12. 

Figure 4.15 shows part of the axioms encoding the DTD in Figure 4.13. 
The two roles f and r are used to encode binary trees, and such roles 
are globally functional (axiom (4.1)). Moreover, the well-founded construct 
(see Chapter 5) wf( f U r) is used to express that there can be no infinite 
chain of objects, each one connected to the next by means of f U r. Such a 
condition turns out to be necessary to correctly capture the fact that XML 
documents correspond to trees that are finite. For each element type E, the 
atomic concepts StartF? and End E represent respectively the start tags (4.2) 
and end tags (4.3) for E, and such tags are leaves of the tree (4.4). The 
remaining leaves of the tree are free text, represented by the atomic concept 
PCDATA (4.5). Using such concepts and roles, one can introduce for each 
element type E appearing in a DTD D an atomic concept Ed, and encode 
the regular expression specifying the structure of elements of type E in a 
suitable complex role, exploiting constructs for regular expressions over roles 
(including the id(-) construct). This is illustrated in Figure 4.15 for part of 
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Fig. 4.14. The binary tree corresponding to the XML document on the left-hand 
side of Figure 4.12. 


T = < If n <lrn ui/(f U r) (4.1) 

Start/; C Tag for each element type E (4.2) 

End E C Tag for each element type E (4.3) 

Tag C V(fl_lr)._L (4.4) 

PCDATA C V(f U r)._L n ^Tag (4.5) 


Customers = Elf.StartCustomers n 3(r o (id(3f. Customers) o r) + ).EndCustomers 
Customers = Elf.StartCustomers n El(r o id(Elf. Names) ° r 

o (jd(Elf. Fields) U id(Elf.SSNs)) o r 
o (jd(Elf.Registereds) o r) + ).EndCustomer 
Names = Elf. StartName n 3(r o id(3f. PCDATA) o r).EndName 


Fig. 4.15. Part of the encoding of the DTD S in Figure 4.13 into a DL knowledge 
base. 

the element types of the DTD in Figure 4.13. We refer to [Calvanese et al, 
1999d] for the precise definition of the encoding. 

The encoding of DTDs into Description Logics can be exploited to verify 
different kinds of properties on DTDs, namely inclusion, equivalence, and 
disjointness between the sets of documents conforming respectively to two 
DTDs. Such reasoning tasks come in different forms. For strong inclusion 
(resp. equivalence, disjointness) both the document structure and the actual 
tag names are of importance when comparing documents, while for structural 
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inclusion (resp. equivalence, disjointness) one abstracts away from the actual 
tag names, and considers only the document structure [Wood, 1995]. Para- 
metric inclusion (resp. equivalence, disjointness) generalizes both notions, 
by considering an equivalence relation between tag names, and comparing 
documents modulo such an equivalence relation. By exploiting the encoding 
of DTDs into Description Logics presented above, all forms of inference on 
DTDs can be carried out in deterministic exponential time [Calvanese et al, 
1999d], 
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Abstract 

This chapter covers extensions of the basic Description Logics introduced 
in Chapter 2 by very expressive constructs that require advanced reasoning 
techniques. In particular, we study reasoning in description logics that in- 
clude general inclusion axioms, inverse roles, number restrictions, reflexive- 
transitive closure of roles, fixpoint constructs for recursive definitions, and 
relations of arbitrary arity. The chapter will also address reasoning w.r.t. 
knowledge bases including both a TBox and an ABox, and discuss more 
general ways to treat objects. Since the logics considered in the chapter lack 
the finite model property, finite model reasoning is of interest and will also 
be discussed. Finally, we mention several extensions to description logics 
that lead to undecidability, confirming that the expressive description logics 
considered in this chapter are close to the boundary between decidability 
and undecidability. 


5.1 Introduction 

Description Logics have been introduced with the goal of providing a for- 
mal reconstruction of frame systems and semantic networks. Initially, the 
research has concentrated on subsumption of concept expressions. However, 
for certain applications, it turns out that it is necessary to represent knowl- 
edge by means of inclusion axioms without limitation on cycles in the TBox. 
Therefore, recently there has been a strong interest in the problem of rea- 
soning over knowledge bases of a general form. See Chapters 2, 3, and 4 for 
more details. 
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When reasoning over general knowledge bases, it is not possible to 
gain tractability by limiting the expressive power of the description logic, 
because the power of arbitrary inclusion axioms in the TBox alone leads 
to high complexity in the inference mechanisms. Indeed, logical implication 
is ExpTiME-hard even for the very simple language AC (see Chapter 3). 
This has led to the investigation of very powerful languages for expressing 
concepts and roles, for which the property of interest is no longer tractabil- 
ity of reasoning, but rather decidability. Such logics, called here expressive 
description logics, have the following characteristics: 

1. The language used for building concepts and roles comprises all classical 
concept-forming constructs, plus several role-forming constructs such as inverse 
roles and reflexive-transitive closure. 

2. No restriction is posed on the axioms in the TBox. 

The goal of this chapter is to provide an overview of the results and tech- 
niques for reasoning in expressive description logics. The chapter is organized 
as follows. In Section 5.2, we outline the correspondence between expressive 
description logics and Propositional Dynamic Logics, which has given the 
basic tools to study reasoning in expressive description logics. In Section 5.3, 
we exploit automata-theoretic techniques developed for variants of Proposi- 
tional Dynamic Logics to address reasoning in expressive description logics 
with functionality restrictions on roles. In Section 5.4 we illustrate the ba- 
sic technique of reification for reasoning with expressive variants of number 
restrictions. In Section 5.5, we show how to reason with knowledge bases 
composed of a TBox and an ABox, and discuss extensions to deal with 
names (one-of construct). In Section 5.6, we introduce description logics 
with explicit fixpoint constructs, which are used to express in a natural way 
inductively and coinductively defined concepts. In Section 5.7, we study de- 
scription logics that include relations of arbitrary arity, which overcome the 
limitations of traditional description logics of modeling only binary links be- 
tween objects. This extension is particularly relevant for the application of 
description logics to databases. In Section 5.8, the problem of finite model 
reasoning in description logics is addressed. Indeed, for expressive description 
logics, reasoning w.r.t. finite models differs from reasoning w.r.t. unrestricted 
models, and requires specific methods. Finally, in Section 5.9, we discuss sev- 
eral extensions to description logics that lead in general to undecidability of 
the basic reasoning tasks. This shows that the expressive description logics 
considered in this chapter are close to the limit of undecidability, and are 
carefully designed in order to retain decidability. 
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5.2 Correspondence between Description Logics and 
Propositional Dynamic Logics 

In this section, we focus on expressive description logics that, besides the 
standard ACC constructs, include regular expressions over roles and possibly 
inverse roles [Baader, 1991; Schild, 1991]. It turns out that such description 
logics correspond directly to Propositional Dynamic Logics, which are modal 
logics used to express properties of programs. We first introduce the syntax 
and semantics of the description logics we consider, then introduce Proposi- 
tional Dynamic Logics, and finally discuss the correspondence between the 
two formalisms. 


5.2.1 Description Logics 

We consider the description logic ACCl reg , in which concepts and roles are 
formed according to the following syntax: 

C,C' — ► A I C I CnC' I Cue' I MR.C \ 3R.C 
R,R' — > P | R U R! | R o R! | R* | id(C) | R~ 

where A and P denote respectively atomic concepts and atomic roles, and 
C and R denote respectively arbitrary concepts and roles. 

In addition to the usual concept-forming constructs, ACCl reg provides 
constructs to form regular expressions over roles. Such constructs include 
role union , role composition, reflexive-transitive closure, and role identity. 
Their meaning is straightforward, except for role identity id{C) which, given 
a concept C, allows one to build a role which connects each instance of C 
to itself. As we shall see in the next section, there is a tight correspondence 
between these constructs and the operators on programs in Propositional 
Dynamic Logics. The presence in the language of the constructs for regular 
expressions is specified by the subscript “reg” in the name. 

ACClreg also includes the inverse role construct, which allows one to 
denote the inverse of a given relation. One can, for example, state with 
3ch ild . Doctor that someone has a parent who is a doctor, by making use of 
the inverse of the role child. It is worth noticing that, in a language without 
inverse of roles, in order to express such a constraint one must use two 
distinct roles (e.g., child and parent) that cannot be put in the proper relation 
to each other. We use the letter 1 in the name to specify the presence of 
inverse roles in a description logic; by dropping inverse roles from ACCl reg , 
we obtain the description logic ACC reg . 

From the semantic point of view, given an interpretation 1, concepts are 
interpreted as subsets of the domain A x , and roles as binary relations over 



196 


D. Calvanese and G. De Giacomo 


A x , as follows: 1 
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We consider the most general form of TBoxes constituted by general in- 
clusion axioms of the form C C C', without any restriction on cycles. We 
use C = C as an abbreviation for the pair of axioms C C C' and C' C C. 
We adopt the usual descriptive semantics for TBoxes (see Chapter 2). 

Example 5.1 The following ACCT reg TBox Tp e models a file system con- 
stituted by file-system elements (FSelem), each of which is either a Directory 
or a File. Each FSelem has a name, a Directory may have children while a File 
may not, and Root is a special directory which has no parent. The parent 
relationship is modeled through the inverse of the role child. 


FSelem 

C 

3name. String 

FSelem 


Directory U Fil 

Directory 

: 

-.File 

Directory 

c 

Vchild. FSelem 

File 

c 

Vchild.T 

Root 

c 

Directory 

Root 

c 

Vchild - . T. 


The axioms in Tp e imply that in a model every object connected by a chain 
of role child to an instance of Root is an instance of FSelem. Formally, Tp e \= 
3(child - )*.Root C FSelem. To verify that the implication holds, suppose that 

1 We use TZ* to denote the reflexive-transitive closure of the binary relation TZ, and TZi o 7?. 2 to 
denote the chaining of the binary relations 7Zi and 7^2- 
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there exists a model in which an instance o of 3(child~)*.Root is not an 
instance of FSelem. Then, reasoning by induction on the length of the chain 
from the instance of Root to o, one can derive a contradiction. Observe that 
induction is required, and hence such reasoning is not first-order. ■ 


In the following, when convenient, we assume, without loss of generality, 
that U and MR.C are expressed by means of n, and 3R.C. We also assume 
that the inverse operator is applied to atomic roles only. This can again be 
done without loss of generality, since the following equivalences hold: (R± o 
R- 2 )~ = Rf 0 Rf > (Ri U R- 2 ) - = R\ U Rf > (R*)~ = {R )*i and ( id(C))~ = 
id(C). 


5.2.2 Propositional Dynamic Logics 

Propositional Dynamic Logics (PDLs) are modal logics specifically devel- 
oped for reasoning about computer programs [Fischer and Ladner, 1979; 
Kozen and Tiuryn, 1990; Harel et al . , 2000]. In this subsection, we pro- 
vide a brief overview of PDLs, and illustrate the correspondence between 
description logics and PDLs. 

Syntactically, a PDL is constituted by expressions of two sorts: programs 
and formulae. Programs and formulae are built by starting from atomic pro- 
grams and propositional letters , and applying suitable operators. We denote 
propositional letters by A. arbitrary formulae by <p, atomic programs by P, 
and arbitrary programs by r, all possibly with subscripts. We focus on con- 
verse- pdl [Fischer and Ladner, 1979] which, as it turns out, corresponds to 
ACCl reg . The abstract syntax of converse - PDL is as follows: 

<p,(p' — >T | _L | A | <p/\<p' | <p\J cp' | -«f) | (r)cp | [r\<p 
r, r' — > P | rUr \ r\r' \ r* \ (pi \ r~ . 

The basic Propositional Dynamic Logic pdl [Fischer and Ladner, 1979] is 
obtained from converse - PDL by dropping converse programs r~ . 

The semantics of PDLs is based on the notion of (Kripke) structure, de- 
fined as a triple JA = (5,{7£p},II), where S denotes a non-empty set of 
states, {Rp} is a family of binary relations over S, each of which denotes 
the state transitions caused by an atomic program P, and II is a mapping 
from S to propositional letters such that II(s) determines the letters that 
are true in state s. The basic semantical relation is “a formula cp holds at a 
state s of a structure M ” , written A4,s j= <p, and is defined by induction on 
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the formation of 4>: 
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M, 
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nr 

for all s', (s, s') G 7Z r implies A4, s' j= (p 


where the family {IZp} is systematically extended so as to include, for ev- 
ery program r, the corresponding relation 7 Z r defined by induction on the 
formation of r: 


7Zp 

C S xS 


T^rUr' 

= 7Z r U 7Z r > 


P, rr i 

= 7Z r o 7 Z r > 


7Z r * 

= (n r y 



= {(s, s) G S x S \ M., s 

l=<^} 

7Z r - 

= {(si,s 2 ) G S x S | (s 2; 

, si) G 7Z r } 


If, for each atomic program P, the transition relation IZp is required to be 
a function that assigns to each state a unique successor state, then we are 
dealing with the deterministic variants of PDLs, namely dpdl and converse- 
dpdl [Ben-Ari et al. , 1982; Vardi and Wolper, 1986]. 

It is important to understand, given a formula </>, which are the formulae 
that play some role in establishing the truth-value of 0. In simpler modal 
logics, these formulae are simply all the subformulae of (j>, but due to the 
presence of reflexive-transitive closure this is not the case for PDLs. Such 
a set of formulae is given by the Fischer-Ladner closure of (f> [Fischer and 
Ladner, 1979]. 

To be concrete we now illustrate the Fischer-Ladner closure for converse- 
pdl. However, the notion of Fischer-Ladner closure can be easily extended 
to other PDLs. Let us assume, without loss of generality, that V and [•] 
are expressed by means of A, and (•). We also assume that the converse 
operator is applied to atomic programs only. This can again be done without 
loss of generality, since the following equivalences hold: (r U r')~ = r~ U r' _ , 
(r;r') _ = r'“;r _ , (r*)~ = (r - )*, and (</>?)~ = </>?. 
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The FischemLadner closure of a converse- pdl formula ip, denoted CL(ip), 
is the least set F such that ip £ F and such that: 

if (p & F then —up £ F (if (p is not of the form ~'(p') 

if ~>(p £ F then cp £ F 

if (p A (p 1 £ F then p,<p' £ F 

if {r)(p £ F then cp £ F 

if (r U r')<p £ F then {r)(p, (• r')(p £ F 

if (r‘,r')(p€F then (r)(r')<^ £ F 

if (r*)(p £ F then (r)(r*)(p € F 

if (c p'l)<p £ F then cp' £ F. 

Note that CL(ip ) includes all the subformulae of ip , but also formulae of the 
form {r)(r*)<p derived from ( r*)<p , which are in fact bigger than the formula 
they derive from. On the other hand, both the number and the size of the 
formulae in CL(ip ) are linearly bounded by the size of ip [Fischer and Ladner, 
1979], exactly like the set of subformulae. Note also that, by definition, if 
(p £ CL(ip), then CL(<p ) C CL (ip). 

A structure Ad = (S, {7£p},II) is called a model of a formula <p if there 
exists a state s £ S such that Ad, s |= <p. A formula cp is satisfiable if there 
exists a model of (p, otherwise the formula is unsatisfiable. A formula cp is 
valid in a structure Ad if for all s £ S, Ad, s \= <p. Formulae that are used to 
select the interpretations of interest are called axioms. Formally, a structure 
Ad is a model of an axiom cp, if cp is valid in Ad. A structure Ad is a model 
of a finite set of axioms T if Ad is a model of all axioms in T. An axiom is 
satisfiable if it has a model, and a finite set of axioms is satisfiable if it has 
a model. We say that a finite set T of axioms logically implies a formula (p, 
written F |= cp, if cp is valid in every model of F. 

It is easy to see that satisfiability of a formula (p as well as satisfiability of 
a finite set of axioms F can be reformulated by means of logical implication, 
as 0 \fi= -■ cp and T \f= _L respectively. 

Interestingly, logical implication can, in turn, be reformulated in terms 
of satisfiability, by making use of the following theorem (see [Kozen and 
Tiuryn, 1990]). 

Theorem 5.2 (Internalization of axioms) Let T be a finite set of 
converse-PDL axioms, and <p a converse-PDL formula. Then T \= <p if and 
only if the formula 

~-<P A [(Pi U ■ • • U P m U Pfi U • ■ ■ U P~)*}T ' 

is unsatisfiable, where Pi , ... , P m are all atomic programs occurring in T U 
{<p} and T' is the conjunction of all axioms in F. 
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Such a result exploits the power of program constructs (union, reflexive- 
transitive closure) and the connected model property (i.e., if a formula has 
a model, it has a model which is connected) of PDLs in order to represent 
axioms. The connected model property is typical of modal logics and it is 
enjoyed by all PDLs. As a consequence, a result analogous to Theorem 5.2 
holds for virtually all PDLs. 

Reasoning in PDLs has been thoroughly studied from the computational 
point of view, and the results for the PDLs considered here are summarized 
in the following theorem [Fischer and Ladner, 1979; Pratt, 1979; Ben-Ari 
et al. , 1982; Vardi and Wolper, 1986]: 

Theorem 5.3 Satisfiability in pdl is ExpTime -hard. Satisfiability in PDL, 
in converse-PDL, and in converse-DPDL can be decided in deterministic ex- 
ponential time. 


5.2.3 The correspondence 

The correspondence between description logics and PDLs was first pub- 
lished by Schild [l99l]. 2 In the work by Schild, it was shown that ACCI reg 
can be considered a notational variant of converse- pdl. This observation 
allowed the results on converse - pdl to be exploited for instantly closing 
long-standing issues regarding the decidability and complexity of both sat- 
isfiability and logical implication in ACC reg and ACCl reg . 3 The paper was 
very influential for the research in expressive description logics in the follow- 
ing decade, since thanks to the correspondence between PDLs and descrip- 
tion logics, first results but especially formal techniques and insights could 
be shared by the two communities. The correspondence between PDLs and 
description logics has been extensively used to study reasoning methods for 
expressive description logics. It has also led to a number of interesting ex- 
tensions of PDLs in terms of those constructs that are typical of description 
logics and have never been considered in PDLs. In particular, there is a 
tight relation between qualified number restrictions and graded modalities 
in modal logics [Van der Hoek, 1992; Van der Hoek and de Rijke, 1995; 
Fattorosi-Barnaba and De Caro, 1985; Fine, 1972]. 

The correspondence is based on the similarity between the interpretation 
structures of the two logics: at the extensional level, individuals (members 

2 In fact, the correspondence was first noticed by Levesque and Rosenschein at the beginning of 
the 1980s, but never published. In those days Levesque just used it in seminars to show the 
intractability of certain description logics. 

3 In fact, the decidability of ACCreg without the id(C) construct was independently established 
by Baader [1991]. 
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of A 1 ) in description logics correspond to states in PDLs, and links between 
two individuals correspond to state transitions. At the intensional level, 
concepts correspond to propositions, and roles correspond to programs. For- 
mally, the correspondence is realized through a one-to-one and onto mapping 
t from ACCl reg concepts to converse- PDL formulae, and from ACCT reg roles 
to converse- pdl programs. The mapping r is defined inductively as follows: 


r(A) 

= A 

r(P) 

= P 

rhC) 

= MC) 

r(R~) 

= r(R)~ 

t(C n C') 

= t(C) A t(C') 

t(R U R') 

= t(R) U t{R' 

t(C u C ) 

= r(C) V t{C') 

t(R o R!) 

II 

s 

§ 

t(VR.C ) 

= [r(R)]r{C) 

r(R*) 

= r(R)* 

t(3R.C) 

= (r(R))r(C) 

r(id(C)) 

= r(C)? 


Axioms in TBoxes of description logics correspond in the obvious way to 
axioms in PDLs. Moreover all forms of reasoning (satisfiability, logical im- 
plication, etc.) have their natural counterpart. 

One of the most important contributions of the correspondence is obtained 
by rephrasing Theorem 5.2 in terms of description logics. It says that every 
TBox can be “internalized” into a single concept, i.e., it is possible to build 
a concept that expresses all the axioms of the TBox. In doing so we rely 
on the ability to build a “universal” role, i.e., a role linking all individuals 
in a (connected) model. Indeed, a universal role can be expressed by using 
regular expressions over roles, and in particular the union of roles and the 
reflexive-transitive closure. The possibility of internalizing the TBox when 
dealing with expressive description logics tells us that for such description 
logics reasoning with TBoxes, i.e., logical implication, is no harder than 
reasoning with a single concept. 

Theorem 5.4 Concept satisfiability and logical implication in ACC reg are 
ExpTlME-hard. Concept satisfiability and logical implication in ACC reg and 
ACCT reg can be decided in deterministic exponential time. 

Observe that for description logics that do not allow the expression of 
a universal role, there is a sharp difference between reasoning techniques 
used in the presence of TBoxes, and techniques used to reason on concept 
expressions. The profound difference is reflected by the computational prop- 
erties of the associated decision problems. For example, the logic AC admits 
simple structural algorithms for deciding reasoning tasks not involving ax- 
ioms, and these algorithms are sound and complete and work in polynomial 
time. However, if general inclusion axioms are considered, then reasoning 
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becomes ExpTlME-complete (see Chapter 3), and the decision procedures 
that have been developed include suitable termination strategies [Buchheit 
et al . , 1993a]. Similarly, for the more expressive logic ACC, reasoning tasks 
not involving a TBox are PSPACE-complete [Schmidt-Schaufi and Srnolka, 
1991], while those that do involve one are ExpTlME-complete. 


5.3 Functional restrictions 

We have seen that the logics ACC reg and ACCX reg correspond to standard 
PDL and converse- pdl respectively, which are both well-studied. In this sec- 
tion we show how the correspondence can also be used to deal with con- 
structs that are typical of description logics, namely functional restrictions, 
by exploiting techniques developed for reasoning in PDLs. In particular, we 
will adopt automata-based techniques, which have been very successful in 
studying reasoning for expressive variants of PDL and characterizing their 
complexity. 

Functional restrictions are the simplest form of number restrictions con- 
sidered in description logics, and allow one to specify local functionality of 
roles, i.e. , that instances of certain concepts have unique role fillers for a 
given role. By adding functional restrictions on atomic roles and their in- 
verse to ACCl reg , we obtain the Description Logic ACCJ~X reg . The PDL 
corresponding to ACCJ-X reg is a PDL that extends converse- dpdl [Vardi 
and Wolper, 1986] with determinism of both atomic programs and their in- 
verse, and such that determinism is no longer a global property, but one 
that can be imposed locally. 

Formally, ACCTX reg is obtained from ACCX reg by adding functional re- 
strictions of the form ^ 1 Q, where Q is a basic role, i.e., either an atomic role 
or the inverse of an atomic role. Such a functional restriction is interpreted 
as follows: 

(< 1 Q) 1 = { 0 gA x ||{ 0 / gA x |( 0 , 0 / )gQ j }|<1}. 

We show that reasoning in ACCTX reg is in ExpTime, and, since reasoning 
in ACC reg is already ExpTiME-hard, is in fact ExpTlME-complete. With- 
out loss of generality we concentrate on concept satisfiability. We exploit 
the fact that ACCFX reg has the tree model property, which states that if an 
ACCFX reg concept C is satisfiable then it is satisfied in an interpretation 
which has the structure of a (possibly infinite) tree with bounded branching 
degree (see later). This allows us to make use of techniques based on au- 
tomata on infinite trees. In particular, we make use of two-way alternating 
automata on infinite trees (2ATAs) introduced by Vardi [1998]. 2ATAs were 
used by Vardi [1998] to derive a decision procedure for modal ^.-calculus 
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with backward modalities. We first introduce 2ATAs and then show how 
they can be used to reason in ACCFT reg . 


5.3.1 Automata on infinite trees 

Infinite trees are represented as prefix-closed (infinite) sets of words over 
N (the set of positive natural numbers). Formally, an infinite tree is a set 
of words T C N*, such that if x-c £ T, where x £ N* and c £ N, then also 
x £ T. The elements of T are called nodes, the empty word e is the root of 
T, and for every x £ T, the nodes x-c, with c £ N, are the successors of x. 
By convention we take x-0 = x, and x-i- — 1 = x. The branching degree d(x) 
of a node x denotes the number of successors of x. If the branching degree 
of all nodes of a tree is bounded by k, we say that the tree has branching 
degree k. An infinite path P of T is a prefix-closed set P C T such that for 
every i > 0 there exists a unique node x £ P with \x\ = i. A labeled tree over 
an alphabet E is a pair (T, V), where T is a tree and V : T — ► E maps each 
node of T to an element of E. 

Alternating automata on infinite trees are a generalization of nondeter- 
ministic automata on infinite trees, introduced by Muller and Schupp [1987]. 
They allow an elegant reduction of decision problems for temporal and pro- 
gram logics [Emerson and Jutla, 1991; Bernholtz et al, 1994]. Let 13(1) be 
the set of positive Boolean formulae over I, built inductively by applying A 
and V starting from true, false, and elements of I. For a set J C I and a 
formula (p £ 13(1), we say that J satisfies p if and only if assigning true to 
the elements in J and false to those in I\J makes p true. For a positive 
integer k, let [k] = {—1,0, 1, . . . , k}. A two-way alternating automaton over 
infinite trees with branching degree k is a tuple A = (E, Q, S, c/o , P), where 
E is the input alphabet, Q is a finite set of states, 5 : Q xE — ► B([k\ x Q) 
is the transition function, qo £ Q is the initial state, and F specifies the 
acceptance condition. 

The transition function maps a state q £ Q and an input letter a £ E to a 
positive Boolean formula over [k] x Q. Intuitively, if 6(q,a) = p, then each 
pair (c, q') appearing in p corresponds to a new copy of the automaton going 
to the direction suggested by c and starting in state q' . For example, if k = 2 
and 6(qi, a) = ((1, qfi) A (1, <73)) V (( — 1, q±) A (0, q-fi), when the automaton is 
in the state q\ and is reading the node x labeled by the letter a, it proceeds 
either by sending off two copies, in the states q 2 and q 3 respectively, to the 
first successor of x (i.e. , x-1), or by sending off one copy in the state q\ to the 
predecessor of x (i.e., x-—l) and one copy in the state q% to x itself (i.e., x-0). 

A run of a 2ATA A over a labeled tree (T, V) is a labeled tree (T r ,r) in 
which every node is labeled by an element of T x Q. A node in T r labeled 



204 


D. Calvanese and G. De Giacomo 


by (x, q ) describes a copy of A that is in the state q and reads the node x 
of T. The labels of adjacent nodes have to satisfy the transition function of 
A. Formally, a run ( T r ,r ) is a T x Q-labeled tree satisfying: 

1. £ G T r and r(s) = (s, go). 

2. Let y € T r , with r(y ) = (x,q) and S(q,V(x)) = <p. Then there is a (possibly 
empty) set S = {(ci, gi), . . . , (c„, q n )} C [k] x Q such that: 

• S satisfies <p and 

• for all 1 < i < n, we have that y-i G T r , x-Ci is defined, and r(y-i ) = (a ;-Cj, qf). 

A run (T r , r) is accepting if all its infinite paths satisfy the acceptance 
condition. 4 Given an infinite path P C T r , let inf(P) C Q be the set of states 
that appear infinitely often in P (as second components of node labels). We 
consider here Biichi acceptance conditions. A Biichi condition over a state 
set Q is a subset F of Q, and an infinite path P satisfies F if inf(P) n F / 0. 

The non-emptiness problem for 2ATAs consists in determining, whether a 
given 2ATA accepts a nonempty set of trees. The results by Vardi [1998] pro- 
vide the following complexity characterization of non-emptiness of 2ATAs. 

Theorem 5.5 ([Vardi, 1998]) Given a 2 AT A A with n states and an 
input alphabet with m elements, deciding non-emptiness of A can be done 
in time exponential in n and polynomial in m. 


5.3.2 Reasoning in ACCFT reg 


The (Fischer-Ladner) closure for ACCFT reg extends immediately the anal- 
ogous notion for converse- pdl (see Subsection 5.2.2), treating functional 
restrictions as atomic concepts. In particular, the closure CL(Cq) of an 
ACCFTreg concept Co is defined as the smallest set of concepts such that 
Co € CL(Cq) and such that (assuming U and V to be expressed by means of 
fl and 3, and the inverse operator applied only to atomic roles):' 5 


if Ce CL{C 0 ) then 

if -iG G C'L(C'o) then 

if C n C' G C'L(C'o) then 

if 3R.C G CL(C 0 ) then 

if 3(12 U R').C G CL(C 0 ) then 
if 3(RoR').C £ CL(C 0 ) then 
if 3R*.CeCL(C 0 ) then 

if 3id{C).C' G C'L(C'o) then 


G CL(Cq) (if C is not of the form -i C') 
C G C'L(C'o) 

C, C' G C'L(C'o) 

C G CL{C 0 ) 

3 R.C, 3 R'.C G CL(C 0 ) 

3R.3R'.C G CL(C 0 ) 

3R.3RTC G CL(C 0 ) 

C £ CL{C 0 ). 


The cardinality of CL(C'q) is linear in the length of Co- 


4 No condition is imposed on the finite paths of the run. 

5 We recall that C and C' stand for arbitrary concepts, and R and R' stand for arbitrary roles. 
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It can be shown, following the lines of the proof in [Vardi and Wolper, 
1986] for converse- dpdl, that ACCPT reg enjoys the tree model property , 
i.e. , every satisfiable concept has a model that has the structure of a (possi- 
bly infinite) tree with branching degree linearly bounded by the size of the 
concept. More precisely, we have the following result. 

Theorem 5.6 Every satisfiable ACCFT reg concept Co has a tree model with 
branching degree kc 0 equal to twice the number of elements of CL(Cq). 

This property allows us to check satisfiability of an ACCET reg concept 
Co by building a 2ATA that accepts the (labeled) trees that correspond to 
tree models of Co- Let A be the set of atomic concepts appearing in Co, and 
B = {Qi, • • • , Q n } the set of atomic roles appearing in Co and their inverses. 
We construct from Cq a 2ATA Af 0 that checks that Co is satisfied at the 
root of the input tree. We represent in each node of the tree the information 
about which atomic concepts are true in the node, and about the basic role 
that connects the predecessor of the node to the node itself (except for the 
root). More precisely, we label each node with a pair a = ( a,q ), where a is 
the set of atomic concepts that are true in the node, and q = Q if the node 
is reached from its predecessor through the basic role Q. That is, if Q stands 
for an atomic role P, then the node is reached from its predecessor through 
P, and if Q stands for P~ , then the predecessor is reached from the node 
through P. In the root, q = Pdum, where Pdum is a new symbol representing 
a dummy role. 

Given an ACCPl reg concept Co, we construct an automaton A<y 0 that 
accepts trees that correspond to tree models of Co- For technical reasons, it 
is convenient to consider concepts in negation normal form (i.e., negations 
are pushed inside as much as possible) . It is easy to check that the transfor- 
mation of a concept into negation normal form can be performed in linear 
time in the size of the concept. Below, we denote by nnf(C) the negation 
normal form of C, and by CL nn f(Cfi) the set { nnf(C ) | C G CL(Cq)}. The 
automaton A<y 0 = (£, S, 6, Si n i, F) is defined as follows: 

• The alphabet is £ = 2 A x (B U {Pdum}), i.e., the set of pairs whose first compo- 
nent is a set of atomic concepts, and whose second component is a basic role or 
the dummy role Pdum- This corresponds to labeling each node of the tree with 
a truth assignment to the atomic concepts, and with the role used to reach the 
node from its predecessor. 

• The set of states is S = U CL nn f{Co) U {Q,^Q \ Q G £>}, where s™ is the 
initial state, CL nn f(Co) is the set of concepts (in negation normal form) in the 
closure of Co, and {Q, ~^Q \ Q G B} are states used to check whether a basic role 
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labels a node. Intuitively, when the automaton in a state C £ CL nn f(Co) visits 
a node x of the tree, this means that the automaton has to check that C holds 
in x. 

• The transition function S is defined as follows: 

1. For each a £ 2- 4 , there is a transition from the initial state 

[ot, Pdum)') — (0, nnf (C-q) ) • 


Such a transition checks that the root of the tree is labeled with the dummy 
role Pdumi and moves to the state that verifies Co in the root itself. 

2. For each ( a,q ) £ E and each atomic concept A £ A, there are transitions 


(«,?)) 
6(-u4, (a,q)) 


( true, if A £ a 
\ false, if A a 
J true, if A ^ a 
\ false, if A £ a. 


Such transitions check the truth value of atomic concepts and their negations 
in the current node of the tree. 

3. For each (a,q) £ E and each basic role Q £ B, there are transitions 


S(Q,(a,q)) 

6(~>Q,(a,q)) 


J true, if q = Q 
\ false, if q ^ Q 
J true, if q ^ Q 
\ false, if q = Q. 


Such transitions check through which role the current node is reached. 

4. For the concepts in CL nn f(Co) and each a £ S, there are transitions 


6(CnC',a) = (0,C) A (0,C') 

6{CuC,a) = (0,(7) V (0,C') 

S(VQ.C,a) = ((0, ->Q~) V (—1, C)) 

^ Ai<i<fc Co ((i,-Q) V(*,(7)) 


5(\/(RuR').C,a) 
(5(V(f?of?').C,cr) 
S(WR*.C, a) 
S(Wid(C).C' , a) 
5(3Q.C, a) 
6(3{RuR').C,a) 
S(3(R of?') .C,a) 
S(3R*.C, a) 
S(3id(C).C',a) 


(0 ,Vf?.C) A (0,Vf?'.C) 

(0 ,VR.VR'.C) 

(0,(7) A (0 ,\/R.VR*.C) 

(0, nnf V (0, C') 

((0, Q~) A (—1, C)) V Vi <i<k Co ((*,Q) A(*,(7)) 
(0, 3R.C) V (0,3 R'.C) 

(0 ,3R.3R'.C) 

(0,(7) V (0,3R.3R* .C) 

(0,(7) A (0,(70. 


All such transitions, except for those involving VR* .C and 3 R*.C, inductively 
decompose concepts and roles, and move to appropriate states of the automa- 
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ton and nodes of the tree. The transitions involving MR* .C treat MR* .C as the 
equivalent concept C n MR.MR*.C, and the transitions involving 3 R* .C treat 
3R*.C as the equivalent concept C U 3R.3R* .C. 

5. For each concept of the form ^ 1 Q in CL nn f(C ) and each a £ £, there is a 
transition 

<5«lQ,cr) = ((0, Q~) A l\ l < i < kco {i,^Q))M 

((0 ,^Q~) A Ai<i<j<; SCo ((A^ ( 5) v (R~ i Q)))- 

Such transitions check that, for a node x labeled with ^ IQ, there exists at 
most one node (among the predecessor and the successors of x) reachable from 
x through Q. 

6. For each concept of the form 1 Q in CL nn f(C) and each a £ S, there is a 
transition 


<5(-<lQ,cr) = ((0, Q ) A Vi<*<fc O0 (*»Q)) v 

V 1 <i<j<kc 0 ((*> Q) ^ (A Q))- 

Such transitions check that, for a node x labeled with -i^ 1 Q , there exist at 
least two nodes (among the predecessor and the successors of x) reachable from 
x through Q. 

• The set F of final states is the set of concepts in CL nn f(C 0 ) of the form MR*.C. 
Observe that concepts of the form 3R*.C are not final states, and this is suffi- 
cient to guarantee that such concepts are satisfied in all accepting runs of the 
automaton. 

A run of the automaton A c 0 on an infinite tree starts in the root, checking 
that Cq holds there (item 1 above). It does so by inductively decomposing 
nnf (Co) while appropriately navigating the tree (items 3 and 4) until it 
arrives at atomic concepts, functional restrictions, and their negations. These 
are checked locally (items 2, 5, and 6). Concepts of the form MR* .C and 
3R* .C are propagated using the equivalent concepts C n MR. MR* .C and C LI 
3R.3R*.C, respectively. It is only the propagation of such concepts that 
may generate infinite branches in a run. Now, a run of the automaton may 
contain an infinite branch in which 3R* .C is always resolved by choosing 
the disjunct 3R.3R* .C , without ever choosing the disjunct C. This infinite 
branch in the run corresponds to an infinite path in the tree where R is 
iterated forever and in which C is never fulfilled. However, the semantics of 
3R* .C requires that C is fulfilled after a finite number of iterations of R. 
Hence such an infinite path cannot be used to satisfy 3R*.C. The acceptance 
condition of the automaton, which requires that each infinite branch in a 
run contains a state of the form MR* .C, rules out such infinite branches 
in accepting runs. Indeed, a run always deferring the fulfillment of C will 
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contain an infinite branch where all states have the form 3R±. ■ ■ ■ 3R n .3R* .C , 
with n > 0 and R± o ■ ■ ■ o R, n a postfix of R. Observe that the only remaining 
infinite branches in a run are those that arise by propagating concepts of 
the form MR* .C indefinitely often. The acceptance condition allows for such 
branches. 

Given a labeled tree T = (T, V) accepted by Ap 0 , we define an interpreta- 
tion Ip = (A 1 , - x ) as follows. First, we define for each atomic role P, a rela- 
tion 7 Zp as follows: IZp = {(x, xi ) | xi £ T and V (xi) = (a, P ) for some a £ 
2-^} U {(xi,x) | xi £ T and V(xi ) = ( a,P ~ ) for some a £ 2^}. Then, using 
such relations, we define: 

• A 1 = { x | (e, x) £ (\J P (Rp U Up))* }; 

• A x = A 1 fl {cc | V{x) = (a, q) and A £ a, for some a £ 2 A and q £ BU 
{ P dum } }, for each atomic concept A- 

• P x = (A 1 x A 1 ) fl 7 Zp, for each atomic role P. 

Lemma 5.7 If a labeled tree T is accepted by A c 0 , then 2p is a model of 

C 0 . 

Conversely, given a tree model X of Co with branching degree kc 0 , we can 
obtain a labeled tree Tj = (T, V ) (with branching degree £:c 0 ) as follows: 

• T= A x ; 

• V(e) = (a, Pdum), where a = {T | e £ A J }\ 

• V(xi) = (a, Q), where a = {A \ xi £ A 1 } and (x,xi) £ Q x . 

Lemma 5.8 If I is a tree model of Co with branching degree kc 0 , then Tp 
is a labeled tree accepted by A c 0 ■ 

From the lemmas above and the tree model property of ACCPZ reg (The- 
orem 5.6), we get the following result. 

Theorem 5.9 An ACCPT reg concept Cq is satisfiable if and only if the set 
of trees accepted by A c 0 is not empty. 

From this theorem, it follows that we can use algorithms for non-enrptiness 
of 2ATAs to check satisfiability in ACCPl reg . It turns out that such a deci- 
sion procedure is indeed optimal w.r.t. the computational complexity. The 
2ATA A c 0 has a number of states that is linear in the size of Co, while the 
alphabet is exponential in the number of atomic concepts occurring in Cq. 



Expressive Description Logics 


209 


By Theorem 5.5 we get an upper bound for reasoning in ACCTI reg that 
matches the ExpTime lower bound. 

Theorem 5.10 Concept satisfiability (and hence logical implication) in 
ACCTX reg is ExpTime- complete. 

Functional restrictions, in the context of expressive description logics 
that include inverse roles and TBox axioms, were originally studied in 
[De Giacomo and Lenzerini, 1994a; De Giacomo, 1995] using the so called 
axiom schema instantiation technique. The technique is based on the idea 
of devising an axiom schema corresponding to the property of interest (e.g., 
functional restrictions) and instantiating such a schema to a finite (poly- 
nomial) number of concepts. A nice illustration of this technique is the 
reduction of converse- PDL to pdl in [De Giacomo, 1996]. Axiom schema 
instantiation can be used to show that reasoning w.r.t. TBoxes is Exp- 
TiME-complete in significant subcases of ACCTI reg (such as reasoning w.r.t. 
ACCTI TBoxes [Calvanese et al. , 2001b]). However, it is still open whether 
it can be applied to show ExpTiME-completeness of ACCTI reg . The at- 
tempt in this direction presented in [De Giacomo and Lenzerini, 1994a; 
De Giacomo, 1995] turned out to be incomplete [Zakharyaschev, 2000]. 

5.4 Qualified number restrictions 

Next we deal with qualified number restrictions, which are the most gen- 
eral form of number restrictions, and allow one to specify arbitrary car- 
dinality constraints on roles with role fillers belonging to a certain con- 
cept. In particular we will consider qualified number restrictions on basic 
roles, i.e., atomic roles and their inverse. By adding such constructs to 
ACCI reg we obtain the description logic ACCQIreg- The PDL correspond- 
ing to ACCQXreg is an extension of converse- pdl with “graded modalities” 
[Fattorosi-Barnaba and De Caro, 1985; Van der Hoek and de Rijke, 1995; 
Tobies, 1999c] on atomic programs and their converse. 

Formally, ACCQIreg is obtained from ACCI reg by adding qualified number 
restrictions of the form ^ nQ.C and fi-nQ.C, where n is a nonnegative 
integer, Q is a basic role, and C is an ACCQI reg concept. Such constructs 
are interpreted as follows: 

(^ nQ.C ) T = {o e A x | | {o' e A x | (o, o') g Q 1 a o' e C' x }| < n} 
(i^nQ.C) 1 = {o G A 1 | | {o' G A 1 | (o, o') G Q 1 A o' G C T }\ > n}. 

Reasoning in ACCQI reg is still ExpTlME-complete under the standard 
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assumption in description logics, that numbers in number restrictions are 
represented in unary. 6 This could be shown by extending the automata- 
theoretic techniques introduced in Section 5.3 to deal also with qualified 
number restrictions. Here we take a different approach and study reason- 
ing in ACCQLreg by exhibiting a reduction from ACC QI reg to ACCTT reg 
[De Giacomo and Lenzerini, 1995; De Giacomo, 1995]. Since the reduction 
is polynomial, we get as a result ExpTiME-completeness of ACCQXreg ■ The 
reduction is based on the notion of reification. Such a notion plays a major 
role in dealing with Boolean combinations of (atomic) roles [De Giacomo 
and Lenzerini, 1995; 1994c], as well as in extending expressive Description 
Logics with relations of arbitrary arity (see Section 5.7). 


5-4 -1 Reification of roles 

Atomic roles are interpreted as binary relations. Reifying a binary relation 
means creating for each pair of individuals (oi, 02 ) in the relation an individ- 
ual which is connected by means of two special roles V\ and V 2 to o\ and 02 , 
respectively. The set of such individuals represents the set of pairs forming 
the relation. However, the following problem arises: in general, there may 
be two or more individuals all connected by means of V± and V 2 to cq and 
02 respectively, and thus all representing the same pair ( 01 , 02 ). Obviously, 
in order to have a correct representation of a relation, such a situation must 
be avoided. 

Given an atomic role P, we define its reified form to be the role 

Vf o id(Ap) o V 2 

where dp is a new atomic concept denoting individuals representing the 
tuples of the relation associated with P, and V\ and V 2 denote two functional 
roles that connect each individual in dp respectively to the first and the 
second component of the tuple represented by the individual. Observe that 
there is a clear symmetry between the role Vf~ o id(Ap) o V 2 and its inverse 
V 2 ~ o id(Ap) o V\. 

Definition 5.11 Let C be an ACCQZ reg concept. The reified counterpart 
£i(C) of C is the conjunction of two concepts, £i(C) = fo(C) n ©i, where: 

• £0 (C) is obtained from the original concept C by (i) replacing every atomic role 
P by the complex role V{~ o id(Ap) o V 2 , where Vj and V 2 are new atomic roles 

6 In [Tobies, 2001a] techniques for dealing with qualified number restrictions with numbers coded 
in binary are presented, and are used to show that even under this assumption reasoning over 
ACCQX knowledge bases can be done in ExpTime. 
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(the only ones present after the transformation) and Ap is a new atomic concept; 
(ii) and then re-expressing every qualified number restriction 

< n (Vj - o id(Ap) o Vf)-D as < n Vj _ .(.Ap n 3Vi.D) 

> n (Vj - o id(Ap) o Vf)-D as ^ n Vj - .(.Ap n XV^.D) 

< n (V 2 - o id(Ap) o V\).D as ^ nV 2 ~ -{Ap n 3Vj.Il) 

> n (Vf o id(Ap) o Vi).D as ^ nV 2 ~ -{Ap n 3Vj.Il) 

• 0i = v(Vi uv 2 u Vf u v 2 - )*. « 1 vi n < 1 v 2 ). 

The next theorem guarantees that, without loss of generality, we can re- 
strict our attention to models of £i(C) that correctly represent relations 
associated with atomic roles, i.e. , models in which each tuple of such rela- 
tions is represented by a single individual. 

Theorem 5.12 If the concept £1 (C) has a model X then it has a model 
X' such that for each (o, o') G (Vj - o id(Ap.) o V 2 ) 1 ' there is exactly one in- 
dividual Ooo' such that (cw,o) G V x and (cw,c/) G V X ■ That is, for all 
01 , 02 , o, o' G A x such that o\ / 02 and o / o', the following condition holds: 

o\ , 02 G Ap i D ~ '((oi , o) G l'i A (02 , o) G 1 1 A (01 , o ) G Vj> A (02 , o ) G Vj ) • 

The proof of Theorem 5.12 exploits the disjoint union model property: let C 
be an ACCQXreg concept and X = (A x , - 1 ) and J = (A^, be two models 
of C, then also the interpretation X l+J J = (CX l±l CX , - x l±l A), which is the 
disjoint union of X and J , is a model of C. We remark that most description 
logics have such a property, which is, in fact, typical of modal logics. Without 
going into details, we just mention that the model X' is constructed from X 
as the disjoint union of several copies of X, in which the extension of role Vj is 
modified by exchanging, in those instances that cause a wrong representation 
of a role, the second component with a corresponding individual in one of 
the copies of X. 

By using Theorem 5.12 we can prove the result below. 

Theorem 5.13 An ACCQXreg concept C is satisfiable if and only if its 
reified counterpart is satisfiable. 


5-4-2 Reducing ACCQXreg to ACCXX reg 

By Theorem 5.13, we can concentrate on the reified counterparts of 
ACCQXreg concepts. Note that these are themselves ACCQXreg concepts, 
but their special form allows us to convert them into ACCXX reg concepts. 
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Intuitively, we represent the role V~ , i = 1,2 (recall that V) is functional 
while V is not), by the role Fy o Fy , where Fy. and Fy are new functional 
roles. 7 The main point of such a transformation is that it is easy to express 
qualified number restrictions as constraints on the chain of (Fy. o Fy * )- 
successors of an individual. Formally, we define the A£CXX re9 -counterpart 
of an ACCQI re g concept as follows. 

Definition 5.14 Let C be an ACCQIreg concept and £i(C) = £o(C) n ©i 
its reified counterpart. The ACC IX reg - counterpart £ 2 (C) of C is the con- 
junction of two concepts, £ 2 (C) = £o(C) A @ 2 , where: 

* to(Q is obtained from £ 0 (C) by simultaneously replacing: 8 

— every occurrence of role V) in constructs different from qualified number restric- 
tions by (Fy o Fy*)~ , where Fy and Fy are new atomic roles; 

— every ^nV~ .D by V(Fy o Fy.* o (id(D) o Fy. + ) n ).^D\ 

— every ^nV~ .D by 3 (F Vi o Fy.* o ( id(D ) o Fy. + ) n ~ 1 ).D. 

• ©2 = V(Uj =1]2 (FV i U Fy. U Fy. U Fy.~))* -(Oi n 6*2), with 9 i of the form: 

<1 F Vi n<i F'y. n<i Fy. n<i F' v r n ^(3 F^.t n 3F{,~.T). 

Observe that 0 2 constrains each model X of £ 2 (C) so that the relations 
X^, F (Fy) 1 , and (F{,~f are partial functions, and each individual 
cannot be linked to other individuals by both (Fy) 1 and (Fy~) x . As a 
consequence, we get that ((Fy o Fy *)~) x is a partial function. This allows 
us to reconstruct the extension of V), as required. 

We illustrate the basic relationships between a model of an ACC QI reg 
concept and the models of its reified counterpart and A£CX'X reff -counterpart 
by means of an example. 

Example 5.15 Consider the concept 

C 0 = 3P.(= 2 P~ .(= 2 P.T)) 

and consider the model X of Co depicted in Figure 5.1, in which a 6 C x . 
Such a model corresponds to a model I of the reified counterpart £i(Co) of 
Co, shown in Figure 5.2. The model I of £i(Co) in turn corresponds to a 
model X" of the A£CX'X reg -counterpart £ 2 (Co) of Co, shown in Figure 5.3. 
Notice that from X" we can easily reconstruct X', and from X' the model X 
of the original concept. ■ 

7 The idea of expressing nonfunctional roles by means of chains of functional roles is due to 
Parikh [l98l], who used it to reduce standard pdl to dpdl. 

8 Here R+ stands for R o R* and R n stands for R o • • • o R (n times). 
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a b 



Fig. 5.1. A model of the ACCQL reg concept C 0 = 3P.(=2P .(=2P.T)). 



Fig. 5.2. A model of the reified counterpart £i(Co) of Co- 


lt can be shown that £i((7) is satisfiable if and only if ^(C) is satisfiable. 
Since, as it is easy to see, the size of (, 2 (C) is polynomial in the size of C, 
we get the following characterization of the computational complexity of 
reasoning in ACCQZ reg . 


Theorem 5.16 Concept satisfiability (and hence logical implication) in 
ACCQIreg is ExpTlME-comp/efe. 


5.5 Objects 

In this section, we review results involving knowledge about individuals ex- 
pressed in terms of membership assertions. Given an alphabet O of symbols 
for individuals, a (membership) assertion has one of the following forms: 

C(a) P(ai,a 2 ) 

where C is a concept, P is an atomic role, and a, ai, a 2 belong to O. 
An interpretation X is extended so as to assign to each a 6 O an element 
a 1 £ j n 3^^ a wa y that the unique name assumption is satisfied, i.e. , 
different elements are assigned to different symbols in O. X satisfies C(a) if 
a 1 6 C 1 , and X satisfies P(a\,a 2 ) if (af,af) G P x . An ABox A is a finite 
set of membership assertions, and an interpretation X is called a model of 
A if X satisfies every assertion in A. 

A knowledge base is a pair K, = ( T,A ), where X is a TBox, and A is 
an ABox. An interpretation X is called a model of 1C if it is a model of 
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Fig. 5.3. A model of the ACCT X reg -counterpart £ 2 (Co) of C 0 . 

both T and A. tC is satisfiable if it has a model, and fC logically implies 
an assertion (3, denoted 1C |= f3, where /3 is either an inclusion or a mem- 
bership assertion, if every model of 1C satisfies (3. Logical implication can 
be reformulated in terms of unsatisfiability: e.g., 1C |= C(a) iff /C U {-iC(a)} 
is unsatisfiable; similarly 1C |= C\ C C 2 iff K, U {(Ci n -iC^Xa 7 )} is unsatisfi- 
able, where a ' does not occur in 1C. Therefore, we only need a procedure for 
checking satisfiability of a knowledge base. 

Next we illustrate the technique for reasoning on ACCQfL reg knowledge 
bases [De Giacomo and Lenzerini, 1996]. The basic idea is as follows: check- 
ing the satisfiability of an ACC QT reg knowledge base 1C = (T, A) is poly- 
nomially reduced to checking the satisfiability of an ACCQfL reg knowledge 
base 1C' = (T' , A'), whose ABox A! is made of a single membership assertion 
of the form C(a). In other words, the satisfiability of K, is reduced to the 
satisfiability of the concept C w.r.t. the TBox T' of the resulting knowledge 
base. The latter reasoning service can be realized by means of the method 
presented in Section 5.4, and, as we have seen, is ExpTiME-complete. Thus, 
by means of the reduction, we get an ExpTime algorithm for satisfiability 
of ACCQZreg knowledge bases, and hence for all standard reasoning services 
on ACCQXreg knowledge bases. 

Definition 5.17 Let 1C = (T,A) be an ACC Ql r eg knowledge base. The 
reduced form of 1C is the ACCQfL reg knowledge base Kf = (T', A') defined as 
follows. We introduce a new atomic role create, and also for each individual 
a*, i = 1, . . » m, occurring in A, a new atomic concept A, . Then: 

A ' = {(Bcreate.Ai n • • • n 3 create. A m )(g)}, 

where g is a new individual (the only one present in A'), and T' = T UT^U 
T aux , where: 

• 7(4 is constituted by the following inclusion axioms: 

— for each membership assertion C{af) £ A, one inclusion axiom 


Ai G C 
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— for each membership assertion P(a,i,cij) £ A, two inclusion axioms 

Ai C 3 P.Aj n < 1 P.Aj 
Aj c a p- .Ai n < 1 p~ .Ai 

— for each pair of distinct individuals Oj and aj occurring in A, one inclusion 
axiom 


Ai C ~^Aj 

• %lux is constituted by one inclusion axiom ( U stands for (Pi U • • • U P n U Pf U 
• • • U P~ )*, where Pi, . . . ,P n are all atomic roles in T U T4): 

AiUC C MU.^AiUC) 

for each A, occurring in T U T4 and each C £ CL ext (T U 7(4), where CL ext (T U 
7(4) is a suitably extended syntactic closure 9 of T U 7(4 whose size is polynomially 
related to the size of T UT4 [De Giacomo and Lenzerini, 1996 ]. 

To understand how the reduced form KL' = {T ' , A') relates to the original 
knowledge base fC = ( T, A ), first observe that the ABox A ' is used to force 
the existence of the only individual g connected by the role create to one 
instance of each A*. It can be shown that this allows us to restrict our 
attention to models of K,' that represent a graph connected to g, i.e., models 
1 = (A x , • x ) of K,' such that A 1 = {5} U {s' | ( g , s') £ create 1 o (\J P (P I U 

P 1 -)*)}. 

The TBox T' consists of three parts T, and T aux . T is the original 
inclusion axioms. 7(4 is what we may call a “naive encoding” of the original 
ABox A as inclusion axioms. Indeed, each individual ai is represented in 
7(4 as a new atomic concept A* (disjoint from the other A^’s), and the 
membership assertions in the original ABox A are represented as inclusion 
axioms in 7(4 involving such new atomic concepts. However T U 7(4 alone 
does not suffice to represent faithfully (w.r.t. the reasoning services we are 
interested in) the original knowledge base, because an individual a* in K, 
is represented by the set of instances of A, in K! . In order to reduce the 
satisfiability of KJ to the satisfiability of 1 C, we must be able to single out, 
for each Aj, one instance of Aj representative of a t . For this purpose, we need 
to include in T a new part, called T aU x , which contains inclusion axioms of 
the form 


Aj n C* c VZ7.(-iAj u C). 


9 The syntactic closure of a TBox is the syntactic closure of the concept obtained by internalizing 
the axioms of the TBox. 
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Intuitively, such axioms say that, if one instance of A t is also an instance of 
C, then every instance of A t is an instance of C. Observe that, if we could 
add an infinite set of axioms of this form, one for each possible concept of 
the language (i.e., an axiom schema), we could safely restrict our attention 
to models of 1C ' with just one instance for every concept Ai, since there 
would be no way in the logic to distinguish two instances of Ai one from the 
other. What is shown by De Giacomo and Lenzerini [1996] is that in fact 
we do need only a polynomial number of such inclusion axioms (as specified 
by Taux) i n order to be able to identify, for each i, an instance of Ai as 
representative of ai. This allows us to prove that the existence of a model of 
K! implies the existence of a model of 1C. 

Theorem 5.18 Knowledge base satisfiability (and hence every standard rea- 
soning service) in ACCQI reg is ExpTiME-comp/ete. 

Using a similar approach, De Giacomo and Lenzerini [1994a] and 
De Giacomo [1995] extend ACCQ reg and ACCX reg by adding special atomic 
concepts A a , called nominals , having exactly one single instance a, i.e., the 
individual they name. Nominals may occur in concepts exactly as atomic 
concepts, and hence they constitute one of the most flexible ways to express 
knowledge about single individuals. 

By using nominals we can capture the “one-of” construct, having the 
form {ai,...,a n }, denoting the concept made of exactly the enumerated 
individuals a \, . . . , a n . 10 We can also capture the “fills” construct, having the 
form R : a, denoting those individuals having the individual a as a role filler 
of R. 11 (See [Schaerf, 1994b] and references therein for further discussion on 
these constructs.) 

Let us denote by ACCQO reg and ACCXO reg the description logics result- 
ing from adding nominals to ACCQ reg and ACCX reg respectively. De Gia- 
como and Lenzerini [1994a] and De Giacomo [1995] polynomially reduce sat- 
isfiability in ACCQO reg and ACCXO reg knowledge bases to satisfiability of 
ACC Q reg and ACCX reg concepts respectively, hence showing decidability and 
ExpTiME-completeness of reasoning in these logics. ExpTlME-completeness 
does not hold for ACCQXO reg , i.e., ACCQIreg extended with nominals. In- 
deed, a result by Tobies [1999a; 1999b] shows that reasoning in such a logic 
is NExpTiME-hard. Its decidability remains an open problem. 

10 Actually, nominals and the one-of construct are essentially equivalent, since a name A a is 

equivalent to {a} and {ai, . . . , a n } is equivalent to A ai U • • • U A an . 

11 The “fills” construct R: a is captured by 3R.A a . 
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The notion of nominal introduced above has a correspondent in modal 
logic [Prior, 1967; Bull, 1970; Blackburn and Spaan, 1993; Gargov and 
Goranko, 1993; Blackburn, 1993]. Nominals have also been studied within 
the setting of PDLs [Passy and Tinchev, 1985; Gargov and Passy, 1988; 
Passy and Tinchev, 1991]. The results for ACCQO reg and ACC10 reg are 
immediately applicable also in the setting of PDLs. In particular, the PDL 
corresponding to ACCQO reg is standard PDL augmented with nominals 
and graded modalities (qualified number restrictions). It is an extension 
of deterministic combinatory PDL, dcpdl, which is essentially dpdl aug- 
mented with nominals. The decidability of dcpdl is established by Passy 
and Tinchev [1985], who also prove that satisfiability can be checked in non- 
deterministic double exponential time. This is tightened by the result above 
on ExpTiME-completeness of A£CQO reg , which says that DcPDL is in fact 
ExpTlME-complete, thus closing the previous gap between the upper bound 
and the lower bound. The PDL corresponding to ACCLO reg is converse- PDL 
augmented with nominals, which is also called converse combinatory PDL, 
Ccpdl [Passy and Tinchev, 1991]. Such a logic was not known to be decid- 
able [Passy and Tinchev, 1991]. Hence the results mentioned above allow 
us to establish the decidability of Ccpdl and to precisely characterize the 
computational complexity of satisfiability (and hence of logical implication) 
as ExpTlME-complete. 


5.6 Fixpoint constructs 

Decidable description logics equipped with explicit fixpoint constructs have 
been devised in order to model inductive and coinductive data struc- 
tures such as lists, streams, trees, etc. [De Giacomo and Lenzerini, 1994d; 
Schild, 1994; De Giacomo and Lenzerini, 1997; Calvanese et al, 1999c]. Such 
logics correspond to extensions of the propositional p,-calculus [Kozen, 1983; 
Streett and Emerson, 1989; Vardi, 1998], a variant of PDL with explicit fix- 
points that is used to express temporal properties of reactive and concurrent 
processes [Stirling, 1996; Emerson, 1996]. Such logics can also be viewed as 
a well-behaved fragment of first-order logic with fixpoints [Park, 1970; 1976; 
Abiteboul et al., 1995]. 

Here, we concentrate on the description logic pACCQL studied by Cal- 
vanese et al. [1999c]. Such a description logic is derived from ACCQL by 
adding least and greatest fixpoint constructs. The availability of explicit fix- 
point constructs allows inductive and coinductive concepts to be expressed 
in a natural way. 
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Example 5.19 Consider the concept Tree, representing trees, inductively 
defined as follows: 

1. An individual that is an EmptyTree is a Tree. 

2. If an individual is a Node, has at most one parent, has some children, and all 
children are Trees, then such an individual is a Tree. 

In other words, Tree is the concept with the smallest extension among those 
satisfying the assertions 1 and 2. Such a concept is naturally expressed in 
li ACCQfL by making use of the least fixpoint construct pX.C: 

Tree = /CT.(EmptyTree U (Node n ^ 1 child n dchild.T n Vchild.X)). 


Example 5.20 Consider the well-known linear data structure, called 
stream. Streams are similar to lists except that, while lists can be considered 
as finite sequences of nodes, streams are infinite sequences of nodes. Such a 
data structure is captured by the concept Stream, coinductively defined as 
follows: 

1. An individual that is a Stream, is a Node and has a single successor which is a 

Stream. 

In other words, Stream is the concept with the largest extension among those 
satisfying condition 1. Such a concept is naturally expressed in pACCQfL by 
making use of the greatest fixpoint construct vX.C: 

Stream = i/X.(Node n ^ 1 succ n Bsucc.X). 


Let us now introduce pACCQI formally. We make use of the standard 
first-order notions of scope, bound and free occurrences of variables, closed 
formulae, etc., treating // and v as quantifiers. 

The primitive symbols in pACCQl are atomic concepts, (concept) vari- 
ables, and atomic roles. Concepts and roles are formed according to the 
following syntax: 

^ nR.C | fiX.C | X 

P an atomic role, C an arbitrary 
pACC QI concept, R an arbitrary pACCQI role (i.e., either an atomic role 
or the inverse of an atomic role), n a natural number, and X a variable. 


C — | -i C | Ci n C 2 
R — ► P | P - 

where A denotes an atomic concept, 
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The concept C in jiX.C must be syntactically monotone , that is, every free 
occurrence of the variable X in C must be in the scope of an even number 
of negations [Kozen, 1983]. This restriction guarantees that the concept C 
denotes a monotonic operator and hence both the least and the greatest 
fixpoints exist and are unique (see later). 

In addition to the usual abbreviations used in ACCQL , we introduce the 
abbreviation vX.C for —^p,X.—^C[X/—^X\, where C[X/~ <X] is the concept 
obtained by replacing all free occurrences of X by —>X. 

The presence of free variables prevents us from extending the interpreta- 
tion function - 1 directly to every concept of the logic. For this reason we 
introduce valuations. A valuation p on an interpretation X is a mapping 
from variables to subsets of A 1 . Given a valuation p, we denote by p[X/£] 
the valuation identical to p except that p[X/£\(X) = £. 

Let T be an interpretation and p a valuation on I. We assign meaning 
to concepts of the logic by associating to 1 and p an extension function - x , 
mapping concepts to subsets of A 1 , as follows: 


X P = 

p(X) C A 1 

K = 

A 1 C A 1 

(- cf P = 

A X \C x p 

(Cinc 2 f p = 

(Cfi) x p n(Cfi) x p 

finR.Cp = 

{s £ A x | {s' | ( s , s') £ R 1 and s' £ C p } > n } 

(pX.Cf p = 

£ A C? W £] c £}. 

Observe that C p i x /£] can be seen as an operator from subsets £ of A 1 to 


subsets of A x , and that, by the syntactic restriction enforced on variables, 
such an operator is guaranteed to be monotonic w.r.t. set inclusion. pX.C 
denotes the least fixpoint of the operator. Observe also that the semantics 
assigned to vX.C is 

Ux.cf^lJiecApsc cJ W£| }. 

Hence vX.C denotes the greatest fixpoint of the operator. 

In fact, we are interested in closed concepts, whose extension is indepen- 
dent of the valuation. For closed concepts we do not need to consider the 
valuation explicitly, and hence the notions of concept satisfiability, logical 
implication, etc. extend straightforwardly. 

Exploiting a recent result on ExpTime decidability of modal ^-calculus 
with converse [Vardi, 1998], and exploiting a reduction technique for 
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qualified number restrictions similar to the one presented in Section 5.4, 
Calvanese et al. [1999c] have shown that the same complexity bound holds 
also for reasoning in fiACC QZ- 

Theorem 5.21 Concept satisfiability (and hence logical implication) in 
pACCQJ is ExpTime -complete. 

For certain applications, variants of fiACCQZ that allow for mutual fix- 
points, denoting least and greatest solutions of mutually recursive equations, 
are of interest [Schild, 1994; Calvanese et al., 1998c; 1999b]. Mutual fixpoints 
can be re-expressed by suitably nesting the kind of fixpoints considered 
here (see, for example, [de Bakker, 1980; Schild, 1994]). It is interesting to 
notice that, although the resulting concept may be exponentially large in 
the size of the original concept with mutual fixpoints, the number of (dis- 
tinct) subconcepts of the resulting concept is polynomially bounded by the 
size of the original one. By virtue of this observation, and using the rea- 
soning procedure of Calvanese et al. [1999c], we can strengthen the above 
result. 


Theorem 5.22 Checking satisfiability of a closed fiACCQZ concept C can 
be done in deterministic exponential time w.r.t. the number of (distinct) 
subconcepts of C. 


Although pACCQl does not have the rich variety of role constructs of 
ACCQZreg, it is actually an extension of ACC QT reg , since any ACCQZ reg 
concept can be expressed in jiACC QI using the fixpoint constructs in a suit- 
able way. To express concepts involving complex role expressions, it suffices 
to resort to the following equivalences: 


3(i?,i o R 2 ).C 
3(R 1 UR 2 ).C 
3R*.C 
3id(D).C 


3R\.3R 2 .C 
3R\.C U 3R 2 .C 
HX.(CU3R.X) 
CnD. 


Note that, according to such equivalences, we have also that 


\/R*.C = uX.(Cn\/R.X). 

Calvanese et al. [1995] advocate a further construct corresponding to an 
implicit form of fixpoint, the so called well-founded concept construct wf 
( R ). Such a construct is used to impose well-foundedness of chains of roles, 
and thus allows one to correctly capture inductive structures. Using explicit 
fixpoints, wf (R) is expressed as fiX.(\/R.X). 
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We remark that, in order to gain the ability to express inductively and 
coinductively defined concepts, it has been proposed to adopt ad hoc se- 
mantics for interpreting knowledge bases, specifically the least fixpoint se- 
mantics for expressing inductive concepts and the greatest fixpoint seman- 
tics for expressing coinductive ones (see Chapter 2 and also [Nebel, 1991; 
Baader, 1990a; 1991; Dionne et al., 1992; Kiisters, 1998; Buchheit et al, 
1998]). Logics equipped with fixpoint constructs allow statements inter- 
preted according to the least and greatest fixpoint semantics to be mixed in 
the same knowledge base [Schild, 1994; De Giacomo and Lenzerini, 1997], 
and thus can be viewed as a generalization of these approaches. 

Recently, using techniques based on alternating two-way automata, it has 
been shown that the propositional /r-calculus with converse programs re- 
mains ExpTiME-decidable when extended with nominals [Sattler and Vardi, 
2001]. This logic corresponds to a description logic which could be called 
pALClO. 


5.7 Relations of arbitrary arity 

A limitation of traditional description logics is that only binary relationships 
between instances of concepts can be represented, while in some real world 
situations it is required to model relationships among more than two objects. 
Such relationships can be captured by making use of relations of arbitrary 
arity instead of (binary) roles. Various extensions of description logics with 
relations of arbitrary arity have been proposed [Schmolze, 1989; Catarci and 
Lenzerini, 1993; De Giacomo and Lenzerini, 1994c; Calvanese et al, 1997; 
1998a; Lutz et al, 1999]. 

We concentrate on the description logic VCR [Calvanese et al., 1997; 
1998a], which represents a natural generalization of traditional description 
logics towards n - ary relations. The basic elements of VCR are atomic re- 
lations and atomic concepts, denoted by P and A respectively. Arbitrary 
relations, of given arity between 2 and n max , and arbitrary concepts are 
formed according to the following syntax: 

R — ► T n | P | ( %i/n:C ) | -R | Ri n R 2 
C — > Ti | A | -.C | Ci n C 2 | 3[$i]R | ^ k [$i]R 

where i denotes a component of a relation, i.e. , an integer between 1 and 
nmax-, n denotes the arity of a relation, i.e., an integer between 2 and n max , 
and k denotes a nonnegative integer. Concepts and relations must be well- 
typed, which means that only relations of the same arity n can be combined 
to form expressions of type Ri n R 2 (which inherit the arity n) , and i < n 
whenever i denotes a component of a relation of arity n. 
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The semantics of VCR is specified through the usual notion of inter- 
pretation X = (A J , x ), where the interpretation function - 1 assigns to each 
concept C a subset C x of A x , and to each relation R of arity n a subset R x 
of (A T ) n , such that the following conditions are satisfied: 


' n — 
pl c 

hRf = 
(Ri n r 2 ) j = 
(Si/n-.C) 1 = 


( A X)n 

T^\R X 

Rf nRf 

m, . . . , d n ) £ T x | di £ C 1 } 


Tf = 
A 1 c 

hcf) x = 
(c 1 nc 2 ) x = 

(3[$i]R) x = 
(0[$i]R) x = 


A 1 

A 1 

A I \C I 

cf n cl 

{d £ A 1 | 3(di, . . . , dn) £ R x . di = d} 

{d £ A x | | {(di, ...,d n )eK x \di = d\}<k} 


where P, R, Ri, and R 2 have arity n. Observe that T 1 denotes the interpre- 
tation domain, while T n , for n > 1, does not denote the ?r-cartesian product 
of the domain, but only a subset of it, that covers all relations of arity n that 
are introduced. As a consequence, the “- 1 ” construct on relations expresses 
difference of relations rather than complement. 

The construct {%i/n:C) denotes all tuples in T n that have an instance 
of concept C as their ith component, and therefore represents a kind of 
selection. Existential quantification and number restrictions on relations are 
a natural generalization of the corresponding constructs using roles. This 
can be seen by observing that, while for roles the “direction of traversal” is 
implicit, for a relation one needs to explicitly say which component is used 
to “enter” a tuple and which component is used to “exit” it. 

VCR, is in fact a proper generalization of ACCQfX . The traditional DL 
constructs can be re-expressed in VCR. as follows: 


3P.C 

as 

3[$1](P n ($2/2: C)) 

3P-.C 

as 

3[$2](P n ($1/2: C)) 

yp.c 

as 

-a[$l](Pn ($2/2: -.C)) 

vp-.c 

as 

-d[$2](Pn ($1/2: -.C)) 

^ kP.C 

as 

^ k [$1](P n ($2/2: C)) 

^ kP-.C 

as 

^ k [$2](P n ($1/2: C)). 


Observe that the constructs using direct and inverse roles are represented 
in VCR by using binary relations and explicitly specifying the direction of 
traversal. 
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A TBox in VCR is a finite set of inclusion axioms on both concepts and 
relations of the form 

C E C' R C R' 

where R and R' are two relations of the same arity. The notions of an 
interpretation satisfying an assertion, and of model of a TBox are defined 
as usual. 

The basic technique used in VCR to reason on relations is reification 
(see Subsection 5.4.1), which allows one to reduce logical implication in 
VCR to logical implication in ACCQfL . Reification for n-ary relations is 
similar to reification of roles (see Definition 5.11): a relation of arity n is 
reified by means of a new concept and n functional roles fi,..,, f n . Let 
the ACCQfL TBox T be the reified counterpart of a VCR TBox T . A 
tuple of a relation R in a model of T is represented in a model of T' by 
an instance of the concept corresponding to R, which is linked through 
/i , . . . , f n respectively to n individuals representing the components of the 
tuple. In this case reification is further used to encode Boolean constructs 
on relations into the corresponding constructs on the concepts representing 
relations. 

As for reification of roles (see Subsection 5.4.1), performing the reification 
of relations requires some care, since the semantics of a relation rules out that 
there may be two identical tuples in its extension, i.e. , two tuples constituted 
by the same components in the same positions. In the reified counterpart, on 
the other hand, one cannot explicitly rule out (e.g., by using specific axioms) 
the existence of two individuals o\ and 02 “representing” the same tuple, i.e., 
that are connected through fi, ■ ■ ■ , f n to exactly the same individuals de- 
noting the components of the tuple. A model of the reified counterpart T' 
of T in which this situation occurs may not correspond directly to a model 
of T, since by collapsing the two equivalent individuals into a tuple, axioms 
may be violated (e.g., cardinality constraints). However, also in this case 
the analog of Theorem 5.12 holds, ensuring that from any model of T' one 
can construct a new one in which no two individuals represent the same 
tuple. Therefore one does not need to take this constraint explicitly into 
account when reasoning on the reified counterpart of a knowledge base with 
relations. Since reification is polynomial, from ExpTiME-decidability of log- 
ical implication in ACCQfL (and ExpTlME-hardness of logical implication in 
ACC) we get the following characterization of the computational complexity 
of reasoning in VCR [Calvanese et al, 1997]. 


Theorem 5.23 Logical implication in VCR is ExpTime- complete. 
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VCR can be extended to include regular expressions built over projec- 
tions of relations on two of their components, thus obtaining VCR reg . Such 
a logic, which represents a generalization of ACCQl reg , allows the inter- 
nalization of a TBox. ExpTiME-decidability (and hence completeness) of 
VCR reg can again be shown by exploiting reification of relations and re- 
ducing logical implication to concept satisfiability in ACC QL reg [Calvanese 
et al, 1998a]. Recently, VCR reg has been extended to VCR ]J: , which includes 
explicit fixpoint constructs on concepts, like those introduced in Section 5.6. 
The ExpTiME-decidability result extends to VCR M as well [Calvanese et al, 
1999c]. 

Recently it has been observed that guarded fragments of first-order logic 
[Andreka et al, 1996; Gradel, 1999] (see Subsection 4.2.1), which include 
n-ary relations, share with description logics the “locality” of quantifica- 
tion. This makes them of interest as extensions of description logics with 
n-ary relations [Gradel, 1998; Lutz et al, 1999]. Such description logics 
are incomparable in expressive power with VCR and its extensions: On 
the one hand the description logics corresponding to guarded fragments 
allow one to refer, by the use of explicit variables, to components of re- 
lations in a more flexible way than is possible in VCR. On the other 
hand such description logics lack number restrictions, and extending them 
with number restrictions leads to undecidability of reasoning. Also, rea- 
soning in the guarded fragments is in general NExpTlME-hard [Gradel, 
1998; 1999] and thus more difficult than in VCR and its extensions, 
although PSPACE-complete fragments have been identified [Lutz et al, 
1999]. 

5.7.1 Boolean constructs on roles and role inclusion axioms 

Observe also that VCR (and VCR reg ) allows Boolean constructs on rela- 
tions (with negation interpreted as difference) as well as relation inclusion 
axioms RCR'. In fact, VCR (resp. VCR reg ) can be viewed as a general- 
ization of ACCQJ. (resp. ACCQI reg ) extended with Boolean constructs on 
atomic and inverse atomic roles. Such extensions of ACCQI were first stud- 
ied in [De Giacomo and Lenzerini, 1994c; De Giacomo, 1995], where logical 
implication was shown to be ExpTiME-complete by a reduction to ACCQI 
(resp. ACCQlreg). The logics above do not allow atomic roles to be com- 
bined with inverse roles in Boolean combinations and role inclusion axioms. 
Tobies [2001a] shows that, for ACCQI extended with arbitrary Boolean com- 
binations of atomic and inverse atomic roles, logical implication remains in 
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ExpTime. Note that, in all logics above, negation on roles is interpreted as 
difference. For results on the impact of full negation on roles see [Lutz and 
Sattler, 2001; Tobies, 2001a]. 

Horrocks et al. [2000b] investigate reasoning in STilQ, which is ACCQI 
extended with roles that are transitive and with role inclusion axioms on 
arbitrary roles (direct, inverse, and transitive). SECIQ does not include 
reflexive-transitive closure. However, transitive roles and role inclusions al- 
low a universal role to be expressed (in a connected model), and hence allow 
TBoxes to be internalized. Satisfiability and logical implication in STiTQ 
are ExpTiME-complete [Tobies, 2001a]. The importance of STL1Q lies in 
the fact that it is the logic implemented by the current state-of-the-art DL- 
based systems (see Chapters 8 and 9). 


5.7.2 Structured objects 

An alternative way to overcome the limitations that result from the restric- 
tion to binary relationships between concepts, is to consider the interpreta- 
tion domain as being constituted by objects with a complex structure, and 
extend the description logics with constructs that allow one to specify such a 
structure [De Giacomo and Lenzerini, 1995]. This approach is in the spirit of 
object-oriented data models used in databases [Lecluse and Richard, 1989; 
Bancilhon and Khoshafian, 1989; Hull, 1988], and has the advantage, with 
respect to introducing relationships, that all aspects of the domain to be 
modeled can be represented in a uniform way, as concepts whose instances 
have certain structures. In particular, objects can either be unstructured or 
have the structure of a set or of a tuple. For objects having the structure 
of a set, a particular role allows one to refer to the members of the set, 
and similarly each component of a tuple can be referred to by means of the 
(implicitly functional) role that labels it. 

In general, reasoning over structured objects can have a very high com- 
putational complexity [Kuper and Vardi, 1993]. However, reasoning over a 
significant fragment of structuring properties can be reduced in polynomial 
time to reasoning in traditional Description Logics, again by exploiting reifi- 
cation to deal with tuples and sets. Thus, for such a fragment, reasoning can 
be done in ExpTime [De Giacomo and Lenzerini, 1995]. An important as- 
pect in exploiting Description Logics for reasoning over structured objects 
is being able to limit the depth of the structure of an object to avoid infinite 
nesting of tuples or sets. This requires the use of a well-founded construct, 
which is a restricted form of fixpoint (see Section 5.6). 
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5.8 Finite model reasoning 

For expressive description logics, in particular for those containing inverse 
roles and functionality, a TBox may admit only models with an infinite 
domain [Cosmadakis et al . , 1990; Calvanese et al, 1994]. Similarly, there may 
be TBoxes in which a certain concept can be satisfied only in an infinite 
model. This is illustrated in the following example by Calvanese [1996c]. 

Example 5.24 Consider the TBox 

FirstGuard C Guard n Vshields~.T 

Guard C EJshields n Vshields. Guard n ^ 1 shields - . 

In a model of this TBox, an instance of FirstGuard can have no shields- 
predecessor, while each instance of Guard can have at most one. Therefore, 
the existence of an instance of FirstGuard implies the existence of an infinite 
sequence of instances of Guard, each one connected through the role shields 
to the following one. This means that FirstGuard can be satisfied in an inter- 
pretation with a domain of arbitrary cardinality, but not in interpretations 
with a finite domain. ■ 

Note that the TBox above is expressed in a very simple description logic, in 
particular AC (see Chapter 2) extended with inverse roles and functionality. 

A logic is said to have the finite model property if every satisfiable formula 
of the logic admits a finite model, i.e. , a model with a finite domain. The 
example above shows that virtually all description logics including function- 
ality, inverse roles, and TBox axioms (or having the ability to internalize 
them) lack the finite model property. The example shows also that to lose 
the finite model property, functionality in only one direction is sufficient. 
In fact, it is well known that converse- dpdl, which corresponds to a frag- 
ment of ACCFlreg, lacks the finite model property [Kozen and Tiuryn, 1990; 
Vardi and Wolper, 1986]. 

For all logics that lack the finite model property, reasoning with respect 
to unrestricted and finite models are fundamentally different tasks, and this 
needs to be taken explicitly into account when devising reasoning proce- 
dures. Restricting reasoning to finite domains is not common in knowledge 
representation. However, it is typically of interest in databases, where one 
assumes that the data available are always finite [Calvanese et al., 1994; 
1999e]. 

When reasoning w.r.t. finite models, some properties that are essential 
for the techniques developed for unrestricted model reasoning in expressive 
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description logics fail. In particular, all reductions exploiting the tree model 
property (or similar properties that are based on “unraveling” structures) 
[Vardi, 1997] cannot be applied since this property does not hold when 
only finite models are considered. An intuitive justification can be given by 
observing that, whenever a (finite) model contains a cycle, the unraveling of 
such a model into a tree generates an infinite structure. Therefore alternative 
techniques have been developed. 

In this section, we study decidability and computational complexity of 
finite model reasoning over TBoxes expressed in various sublanguages of 
ACCQX . Specifically, by using techniques based on reductions to linear 
programming problems, we show that finite concept satisfiability w.r.t. 
to ACUMl TBoxes 12 constituted by inclusion axioms only is ExpTime- 
complete [Calvanese et al, 1994], and that finite model reasoning in arbi- 
trary ACCQT TBoxes can be done in deterministic double exponential time 
[Calvanese, 1996a]. 


5.8.1 Finite model reasoning using linear inequalities 

A procedure for finite model reasoning must specifically address the presence 
of number restrictions, since it is only in their presence that the finite model 
property fails. We discuss a method which is indeed based on an encoding 
of number restrictions into linear inequalities, and which generalizes the 
one developed by Lenzerini and Nobili [1990] for the Entity-Relationship 
model with disjoint classes and relationships (hence without IS-A). We first 
describe the idea underlying the reasoning technique in a simplified case. In 
the next subsection we show how to apply the technique to various expressive 
description logics [Calvanese and Lenzerini, 1994b; 1994a; Calvanese et al, 
1994; Calvanese, 1996a]. 

Consider an ACAfT TBox 13 T containing the following axioms: for each 
pair of distinct atomic concepts A and A' , an axiom A C -i A'\ and for each 
atomic role P, an axiom of the form T C MP.A 2 n VP - .Ai, for some atomic 
concepts A\ and A 2 (not necessarily distinct). Such axioms enforce that in 
all models of T the following hold: 

Pi: The atomic concepts have pairwise disjoint extensions. 

P 2 : Each role is “typed”, which means that its domain is included in the extension 
of an atomic concept A\ , and its codomain is included in the extension of an 
atomic concept A 2 . 

12 ACUMl is the description logic obtained by extending ACUM (see Chapter 2) with inverse 
roles. 

13 ACM'I is the description logic obtained by extending ACM (see Chapter 2) with inverse roles. 
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Assume further that the only additional axioms in T are used to impose 
cardinality constraints on roles and inverse roles, and are of the form 

T c ^ mi P n ^ m P 
T □ ^ 7712 P n ^ 772 P 

where m±, m, m 2 , and 77,2 are positive integers with 777.7 < ni and 7772 < ri 2 - 

Due to the fact that properties Pi and P 2 hold, the local conditions im- 
posed by number restrictions on the number of successors of each individ- 
ual are reflected into global conditions on the total number of instances of 
atomic concepts and roles. Specifically, it is not difficult to see that, for 

a model 1 of such a TBox, and for each P, A\, A 2 , 777.1, 777.2, n \ , and ri 2 

as above, the cardinalities of P x , A{, and A Lj must satisfy the following 
inequalities: 

7771 • \A\\ < I P I 2 3 \ < 77 1 • \Af | 

m 2 -\Al\ < \P I \ < n 2 -|Af|. 

On the other hand, consider the system ^ 7 - of linear inequalities contain- 
ing for each atomic role P typed by A\ and A 2 the inequalities 

7T7i • Var(Ai) < Var(P) < ni-Var(Ai) . 

7772 • Var(A 2 ) < Var(P) < ?72-Var(A2) 

where we denote by Var(A) and Var(P) the unknowns, ranging over the non- 
negative integers, corresponding to the atomic concept A and the atomic role 
P respectively. 

It can be shown that, if the only axioms in T are those mentioned above, 
then certain non-negative integer solutions of 'k-r (called acceptable solu- 
tions) can be put into correspondence with finite models of T . More pre- 
cisely, for each acceptable solution S, one can construct a model of T in 
which the cardinality of each concept or role X is equal to the value as- 
signed by S to Var(A) [Lenzerini and Nobili, 1990; Calvanese et al, 1994; 
Calvanese, 1996c]. Moreover, given ’ll 7 -, it is possible to verify, in time poly- 
nomial in its size, whether it admits an acceptable solution. 

This property can be exploited to check finite satisfiability of an atomic 
concept A w.r.t. a TBox T as follows: 

1. Construct the system T 7 - of inequalities corresponding to T. 

2. Add to fr the inequality Var(A) > 0, which enforces that the solutions cor- 
respond to models in which the cardinality of the extension of A is positive. 

3. Check whether fr admits an acceptable solution. 
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Observe that for simple TBoxes of the form described above, this method 
works in polynomial time, since (i) ^ 7 - is of size polynomial in the size of 
T . and can also be constructed in polynomial time, and (ii) checking the 
existence of acceptable solutions of ^ 7 - can be done in time polynomial in 
its size. Notice also that the applicability of the technique heavily relies on 
conditions Pi and P 2 , which ensure that, from an acceptable solution of ^ 7 -, 
a model of T can be constructed. 


5.8.2 Finite model reasoning in expressive Description Logics 

The method we have presented above is not directly applicable to more 
complex languages or TBoxes not respecting the particular form above. In 
order to extend it to more general cases we make use of the following obser- 
vation: Linear inequalities capture global constraints on the total number 
of instances of concepts and roles. So we have to represent local constraints 
expressed by number restrictions by means of global constraints. This can 
be done only if Pi and the following generalization of P 2 hold: 

V 2 '. For each atomic role P and each concept expression C appearing in T, the 
domain of P is either included in the extension of C or disjoint from it. Similarly 
for the codomain of P. 

This condition guarantees that, in a model, all instances of a concept “be- 
have” in the same way, and thus the local constraints represented by number 
restrictions are indeed correctly captured by the global constraints repre- 
sented by the system of inequalities. 

It is possible to enforce conditions Pi and P 2 for expressive description 
logics, by first transforming the TBox, and then deriving the system of in- 
equalities from the transformed version. We briefly sketch the technique to 
decide finite concept satisfiability in ACUNl TBoxes consisting of special- 
izations , i.e. , inclusion axioms in which the concept on the left-hand side is 
atomic. A detailed account of the technique and an analysis of its computa- 
tional complexity has been presented by Calvanese [1996c]. 

First of all, it is easy to see that, by introducing at most a linear number of 
new atomic concepts and TBox axioms, we can transform the TBox into an 
equivalent one in which the nesting of constructs is eliminated. Specifically, 
in such a TBox the concept on the right-hand side of an inclusion axiom 
is of the form L, L\ LI L 2 , VR.L, i^nR, or ^ nR , where L is an atomic or 
negated atomic concept. For example, given the axiom 


A C CiU C 2 
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where C\ and C2 do not have the form above, we introduce two new atomic 
concepts Ac 1 and Aq 21 and replace the axiom above by the following ones: 

A E A Cl u Ac 2 
Ac, E Ci 
Ac 2 E C 2 . 

Then, to ensure that conditions Pi and P2 are satisfied, we use instead of 
atomic concepts, sets of atomic concepts, called compound concepts 14 and 
instead of atomic roles, so called compound roles. Each compound role is a 
triple (P, Ci, C 2 ) consisting of an atomic role P and two compound concepts 
Ci and C 2 . Intuitively, the instances of a compound concept C are all those 
individuals of the domain that are instances of all concepts in C and are not 
instances of any concept not in C. A compound role ( P , Cl, C 2 ) is interpreted 
as the restriction of role P to the pairs whose first component is an instance 
of Ci and whose second component is an instance of C 2 . 

This means that two different compound concepts have necessarily disjoint 
extensions, and hence that the property corresponding to Pi holds. The 
same observation holds for two different compound roles (P, Ci,C 2 ) and 
(.P, C^jC^) that correspond to the same role P. Moreover, for compound 
roles, the property corresponding to property P 2 holds by definition, and, 
considering that the TBox contains only specializations and that nesting of 
constructs has been eliminated, P2 also holds. 

We first consider the set T of axioms in the TBox that do not involve 
number restrictions. Such axioms force certain compound concepts and com- 
pound roles to be inconsistent , i.e., have an empty extension in all interpre- 
tations that satisfy T . For example, the axiom A\ C _ 1 A 2 makes all com- 
pound concepts that contain both A\ and A 2 inconsistent. Similarly, the 
axiom A\ C VP.A 2 makes all compound roles ( P , C\, C 2 ) such that C\ con- 
tains A\ and C 2 does not contain A 2 inconsistent. Checking whether a given 
compound concept is inconsistent essentially amounts to evaluating a propo- 
sitional formula in a given propositional model (the one corresponding to 
the compound concept), and hence can be done in time polynomial in the 
size of the TBox. Similarly, one can check in time polynomial in the size of 
the TBox whether a given compound role is inconsistent. Observe, however, 
that since the total number of compound concepts and roles is exponen- 
tial in the number of atomic concepts in the TBox, doing the check for all 
compound concepts and roles takes in general exponential time. 

14 A similar technique, called atomic decomposition there, was used by Ohlbach and 

Koehler [1999]. 
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Once the consistent compound concepts and roles have been determined, 
we can introduce for each of them an unknown in the system of inequalities 
(the inconsistent compound concepts and roles are discarded). The axioms in 
the TBox involving number restrictions are taken into account by encoding 
them into suitable linear inequalities. Such inequalities are derived in a way 
similar to inequalities (5.1), except that now each inequality involves one 
unknown corresponding to a compound concept and a sum of unknowns 
corresponding to compound roles. 

Then, to check finite satisfiability of an atomic concept A, we can add to 
the system the inequality 

Y Var(C) > 1 
CC2-4 | AeC 

which forces the extension of A to be nonempty. Again, if the system admits 
an acceptable solution, then we can construct from such a solution a finite 
model of the TBox in which A is satisfied; if no such solution exists, then A is 
not finitely satisfiable. To check finite satisfiability of an arbitrary concept C, 
we can introduce a new concept name A, add to the TBox the axiom ICC, 
and then check the satisfiability of A. Indeed, if A is finitely satisfiable, then 
so is C. Conversely, if the original TBox admits a finite model X in which C 
has a nonempty extension, then we can simply extend X to A by interpreting 
A as C 1 , thus obtaining a finite model of the TBox plus the additional axiom 
in which A is satisfied. 

The system of inequalities can be effectively constructed in time expo- 
nential in the size of the TBox, and checking for the existence of acceptable 
solutions is polynomial in the size of the system [Calvanese et al, 1994; 
Calvanese, 1996a] . Moreover, since verifying concept satisfiability is already 
ExpTlME-hard for TBoxes consisting of specializations only and expressed 
in the much simpler language ACli [Calvanese, 1996b], the above method 
provides a computationally optimal reasoning procedure. 

Theorem 5.25 Finite concept satisfiability in AjCUATI TBoxes consisting 
of specializations only is ExpTime- complete. 

The method can be extended to decide finite concept satisfiability for a 
wider class of TBoxes, in which a negated atomic concept and, more gener- 
ally, an arbitrary Boolean combination of atomic concepts may appear on the 
left-hand side of axioms. In particular, this makes it possible to deal also 
with knowledge bases containing definitions of concepts that are Boolean 
combinations of atomic concepts, and to reason on such knowledge bases in 
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deterministic exponential time. Since ACUML is not closed under negation, 
we cannot immediately reduce logical implication to concept satisfiability. 
However, the technique presented above can be adapted in specific cases 
to decide also finite logical implication in deterministic exponential time 
[Calvanese, 1996c]. 

A further extension of the above method can be used to decide logical 
implication in ACCQI . The technique uses two successive transformations 
on the TBox, each of which introduces a worst-case exponential blowup, and 
a final polynomial encoding into a system of linear inequalities [Calvanese, 
1996c; 1996a]. 

Theorem 5.26 Logical implication w.r.t. finite models in ACCQL can be 
decided in worst- case deterministic double exponential time. 

For more expressive description logics, and in particular for all those de- 
scription logics containing the construct for reflexive-transitive closure of 
roles, the decidability of finite model reasoning is still an open problem. De- 
cidability of finite model reasoning for C 2 , i.e., first-order logic with two vari- 
ables and counting quantifiers (see also Chapter 4, Section 4.2) was shown 
recently [Gradel et al., 1997b]. C 2 is a logic that is strictly more expressive 
than ACCQL TBoxes, since it allows one, for example, to impose cardinality 
restrictions on concepts [Baader et al., 1996] or to use the full negation of 
a role. However, apart from decidability, no complexity bound is known for 
finite model reasoning in C 2 . 

Techniques for finite model reasoning have also been studied in databases. 
In the relational model, the interaction between inclusion dependencies 
and functional dependencies causes the loss of the finite model prop- 
erty, and finite implication of dependencies under various assumptions 
has been investigated by Cosnradakis et al. [1990] . A method for finite 
model reasoning has been presented by Calvanese and Lenzerini [1994b; 
1994a] in the context of a semantic and an object-oriented database model, 
respectively. The reasoning procedure, which represents a direct general- 
ization of the one discussed above to relations of arbitrary arity, does not 
exploit reification to handle relations (see Section 5.7) but directly encodes 
the constraints on them into a system of linear inequalities. 


5.9 Undecidability results 

Several additional DL constructs besides those discussed in the previous 
sections have been proposed in the literature. In this section we present the 
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most important of these extensions, discussing how they influence decidabil- 
ity, and what modifications to the reasoning procedures are needed to take 
them into account. In particular, we discuss Boolean constructs on roles, 
variants of role-value-maps or role agreements, and number restrictions on 
complex roles. Most of these constructs lead to undecidability of reasoning, 
if used in an unrestricted way. Roughly speaking, this is mainly due to the 
fact that the tree model property is lost [Vardi, 1997]. 


5.9.1 Boolean constructs on complex roles 

In those Description Logics that include regular expressions over roles, such 
as ACCQIreg-, since regular languages are closed under intersection and com- 
plementation, the intersection of roles and the complement of a role are al- 
ready expressible, if we consider them applied to the set of role expressions. 
Here we consider the more common approach in PDLs, namely to regard 
Boolean operators as applied to the binary relations denoted by complex 
roles. The logics thus obtained are more expressive than traditional PDL 
[Harel, 1984] and reasoning is usually harder. We notice that the semantics 
immediately implies that intersection of roles can be expressed by means of 
union and complementation. 

Satisfiability in pdl augmented with intersection of arbitrary programs 
is decidable in deterministic double exponential time [Danecki, 1984], and 
so is satisfiability in ACC reg augmented with intersection of complex roles, 
even though these logics have neither the tree model nor the finite model 
property. On the other hand, satisfiability in pdl augmented with comple- 
mentation of programs is undecidable [Harel, 1984], and so is reasoning in 
ACC reg augmented with complementation of complex roles. Also, dpdl aug- 
mented with intersection of complex roles is highly undecidable [Harel, 1985; 
1986], and since global functionality of roles (which corresponds to deter- 
minism of programs) can be expressed by means of local functionality, the 
undecidability carries over to ACC A reg augmented with intersection of roles. 

These proofs of undecidability make use of a general technique based on 
the reduction from the unbounded tiling (or domino) problem [Berger, 1966; 
Robinson, 1971], which is the problem of checking whether a quadrant of the 
integer plane can be tiled using a finite set of tile type - i.e. , square tiles with 
a color on each side - in such a way that adjacent tiles have the same color on 
the sides that touch. 15 We sketch the idea of the proof using the terminology 

15 In fact the reduction is from the II J -complete - and thus highly undecidable - recurring tiling 
problem [Harel, 1986], where one additionally requires that a certain tile occurs infinitely often 
on the :r-axis. 
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of description logics, instead of that of PDLs. The reduction uses two roles 
right and up which are globally functional (i.e. , ^ 1 right, ^ 1 up) and denote 
pairs of tiles that are adjacent in the x and y directions, respectively. By 
means of intersection of roles, right and up are constrained to effectively 
define a two-dimensional grid. This is achieved by imposing for each point 
of the grid (i.e., reachable through rightand up) that by following right o up 
one reaches a point reached also by following up o right: 

V(right U up)*.3((right o up) n (up o right)). 

To enforce this condition, the use of intersection of compositions of atomic 
roles is essential. Reflexive-transitive closure (i.e., V(rightU up)*. (7) is then 
also exploited to impose the required constraints on all tiles of the grid. 
Observe that, in the above reduction, one can use TBox axioms instead of 
reflexive-transitive closure to enforce the necessary conditions in every point 
of the grid. 

The question arises whether decidability can be preserved if one restricts 
Boolean operations to basic roles, i.e., atomic roles and their inverse. This is 
indeed the case if complementation of basic roles is used only to express dif- 
ference of roles, as demonstrated by the ExpTime decidability of VC1Z and 
its extensions, in which intersection and difference of relations are allowed 
(see Section 5.7). 


5.9.2 Role-value-maps 

Another construct, which stems from frame systems, and which provides 
an additional useful means to specify structural properties of concepts, is 
the so called role-value-map [Brachman and Schmolze, 1985], which comes 
in two forms: An equality role-value-map, denoted R,\ = Ra, represents the 
individuals o such that the set of individuals that are connected to o via 
role R,\ equals the set of individuals connected to o via role R 2 - The second 
form of role- value-map is containment role-value-map, denoted R\ C R 2 , 
whose semantics is defined analogously, using set inclusion instead of set 
equality. Using these constructs, one can denote, for example, by means 
of owns o madeJn C livesJn the set of all persons that own only products 
manufactured in the country they live in. 

When role-value-maps are added, the logic loses the tree model property, 
and this construct leads immediately to undecidability of reasoning when 
applied to role chains (i.e., compositions of atomic roles). For ACC reg , this 
can be shown by a reduction from the tiling problem in a similar way to that 
used in [Harel, 1985] for DPDL with intersection of roles. In this case, the 
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concept right o up = up o right involving role- value-map can be used instead 
of role intersection to define the constraints on the grid. The proof is slightly 
more involved than that for dpdl, since one needs to take into account that 
the roles right and up are not functional (while in dpdl all programs /roles 
are functional). However, undecidability holds already for concept sub- 
sumption (with respect to an empty TBox) in AC (in fact RC~) augmented 
with role- value-maps, where the roles involved are compositions of atomic 
roles [Schmidt-Schaufi, 1989] - see Chapter 3 for the details of the proof. 

As for role intersection, in order to show undecidability, it is necessary 
to apply role- value-maps to compositions of roles. Indeed, if the application 
of role- value-maps is restricted to Boolean combinations of basic roles, it 
can be added to ACCQ2 reg without influencing decidability and worst-case 
complexity of reasoning. This follows directly from the decidability results 
for the extension with Boolean constructs on atomic and inverse atomic roles 
(captured by VC1Z). Indeed, R\ C R 2 is equivalent to V(i?i n -life)--!-, and 
thus can be expressed using difference of roles. We observe also that universal 
and existential role agreements introduced in [Hanschke, 1992], which allow 
one to define concepts by posing various types of constraints that relate 
the sets of fillers of two roles, can be expressed by means of intersection 
and difference of roles. Thus reasoning in the presence of role agreements is 
decidable, provided these constructs are applied only to basic roles. 


5. 9. 3 Number restrictions on complex roles 

In ACCRlreg, the use of (qualified) number restrictions is restricted to 
atomic and inverse atomic roles, which guarantees that the logic has the 
tree model property. This property is lost, together with decidability, if func- 
tional restrictions may be imposed on arbitrary roles. The reduction to show 
undecidability is analogous to the one used for intersection of roles, except 
that now functionality of a complex role (i.e., 1 (right o up) LI (up o right)) 

is used instead of role intersection to define the grid. 

An example of decidable logic that does not have the tree model property 
is obtained by allowing the use of role composition (but not transitive 
closure) inside number restrictions. Let us denote by A f(X), where X is 
a subset of {U,n,o, - }, unqualified number restrictions on roles that are 
obtained by applying the role constructs in X to atomic roles. Let us 
denote by ACCAf(X) the description logic obtained by extending ACC (see 
Chapter 2) with number restrictions in N(X). As shown by Baader and 
Sattler [1999], concept satisfiability is decidable for the logic ACCN{o), 
even when extended with number restrictions on union and intersection 
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of role chains of the same length. Notice that decidability for ACCN(o) 
holds only for reasoning on concept expressions and is lost if one considers 
reasoning with respect to a TBox (or alternatively adds transitive closure of 
roles) [Baader and Sattler, 1999] . Reasoning even with respect to the empty 
TBox is undecidable if one adds to ACCJ\f number restrictions on more 
complex roles. In particular, this holds for ACCAT(n, o) (if no constraints on 
the lengths of the role chains are imposed) and for ACCN(U, o~ ) [Baader 
and Sattler, 1999]. The reductions again exploit the tiling problem, and 
make use of number restrictions on complex roles to simulate a universal 
role that is used for imposing local conditions on all points of the grid. 

Summing up, we can state that the borderline between decidability and 
undecidability of reasoning in the presence of number restrictions on com- 
plex roles has been traced quite precisely, although there are still some open 
problems. E.g., it is not known whether concept satisfiability in ACCM(y\ , o) 
is decidable (although logical implication is undecidable) [Baader and 
Sattler, 1999]. 
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Abstract 

This chapter considers, on the one hand, extensions of Description Logics by 
features not available in the basic framework, but considered important for 
using Description Logics as a modeling language. In particular, it addresses 
the extensions concerning: concrete domain constraints; modal, epistemic, 
and temporal operators; probabilities and fuzzy logic; and defaults. 

On the other hand, it considers non-standard inference problems for 
Description Logics, i.e., inference problems that - unlike subsumption or 
instance checking - are not available in all systems, but have turned out 
to be useful in applications. In particular, it addresses the non-standard 
inference problems: least common subsumer and most specific concept; 
unification and matching of concepts; and rewriting. 


6.1 Introduction 

Chapter 2 introduces the language ACCM as a prototypical Description 
Logic, defines the most important reasoning tasks (like subsumption, in- 
stance checking, etc.), and shows how these tasks can be realized with 
the help of tableau-based algorithms. For many applications, the expres- 
sive power of ACCM is not sufficient to express the relevant terminological 
knowledge of the application domain. Some of the most important exten- 
sions of ACCM by concept and role constructs have already been briefly 
introduced in Chapter 2; these and other extensions have then been treated 
in more detail in Chapter 5. All these extensions are “classical” in the sense 
that their semantics can easily be defined within the model-theoretic frame- 
work introduced in Chapter 2. Although combinations of these constructs 
may lead to very expressive Description Logics (the unrestricted combination 
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even to undecidable ones), all the Description Logics obtained this way can 
only be used to represent time- independent, objective, and certain knowl- 
edge. In addition, they do not allow “built-in data structures” like numerical 
domains. 

The “nonclassical” language extensions considered in the first part of 
this chapter try to overcome some of these deficiencies. The extension by 
concrete domains allows us to integrate numerical and other domains in a 
schematic way into Description Logics. The extension of Description Logics 
by modal operators allows the representation of time-dependent and sub- 
jective knowledge (e.g., knowledge about knowledge and belief of intelligent 
agents) . Description Logics that can explicitly represent time have also been 
introduced outside the modal framework. The extension by epistemic opera- 
tors provides a model-theoretic semantics for rules; it can be used to impose 
“local” closed world assumptions, and to integrate integrity constraints into 
Description Logics. In order to represent vague and uncertain knowledge , dif- 
ferent approaches based on probabilistic, possibilistic, and fuzzy logics have 
been proposed. Finally, non-monotonic Description Logics are obtained by 
the integration of defaults into Description Logics. 

When building and maintaining large DL knowledge bases, inference 
services like subsumption and satisfiability are very helpful, but in general 
not quite sufficient for an adequate support of the knowledge engineer. 
For this reason, some DL systems (e.g., Classic) provide their users 
with additional system services, which can formally be reconstructed as 
new types of inference problems. In the second part of this chapter we 
will motivate and introduce the most prominent of these “non-standard” 
inference problems, and try to give an intuition on how they can be solved. 


6.2 Language extensions 

The extensions introduced in this section are “nonclassical” in the sense 
that defining their semantics is not obvious and requires an extension of 
the model-theoretic framework considered until now; for many (but not all) 
of these extensions, nonclassical logics (such as modal and non-monotonic 
logics) are employed to provide the right framework. 


6.2.1 Concrete domains 

A drawback that all Description Logics introduced until now share is that 
all the knowledge must be represented on the abstract logical level. In many 
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applications, one would like to be able to refer to concrete domains and 
predefined predicates on these domains when defining concepts. An example 
of such a concrete domain could be the set of nonnegative integers, with 
predicates such as > (greater or equal) or < (less than). For example, assume 
that we want to give an adequate definition of the concept Woman. The 
first idea could be to use the concept description Human n Female for this 
purpose. However, a newborn female baby would probably not be called a 
woman, and neither would a three-year-old toddler. Thus, as an additional 
property, one could require that a female human being should be old enough 
(e.g., at least 18) to be called a woman. In order to express this property, one 
would like to introduce a new (functional) role has-age, and define Woman by 
an expression of the form Human n Female n 3has-age.>is. Here >is stands 
for the unary predicate {n \ n > 18} of all nonnegative integers greater than 
or equal to 18. 

Stating such properties directly with reference to a given numerical do- 
main seems to be easier and more natural than encoding them somehow into 
abstract concept expressions. In addition, such a direct representation makes 
it possible to use existing reasoners for the concrete domain. For example, 
we could have also decided to introduce a new atomic concept At Least 18 to 
express the property of being at least 18 years old. However, if for some rea- 
son we also need the property of being at least 21 years old, we must make 
sure that the appropriate subsumption relationship between AtLeastl8 and 
AtLeast21 is asserted as well. While this could still be done by adding ap- 
propriate inclusion axioms, it does not appear to be an elegant solution, 
and it would still not take care of other relationships, e.g., the fact that 
AtLeastl8 n AtMostl6 is unsatisfiable. In contrast, an appropriate reasoner 
for intervals of nonnegative integers would automatically take care of these 
relationships. 

The need for such a language extension was already evident to the de- 
signers of early DL systems such as Meson [Edelmann and Owsnicki, 1986; 
Patel-Schneider et at, 1990], Iv-Rep [Mays et al., 1988; 1991a], andCLASSlC 
[Brachman et al, 1991; Borgida and Patel-Schneider, 1994]: in addition to 
abstract individuals, these systems also allow one to refer to “concrete” in- 
dividuals such as numbers and strings. Both the Classic and the K-Rep 
reasoner can deal correctly with intervals, whereas in Meson the user had 
to supply the adequate relationships between the concrete predicates in a 
separate hierarchy. All these approaches are, however, ad hoc in the sense 
that they are restricted to a specific collection of concrete objects. 

In contrast, Baader and Hanschke [1991a] propose a scheme for integrating 
(almost) arbitrary concrete domains into Description Logics. This extension 
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was designed such that: 

• it still has a formal declarative semantics that is very close to the usual semantics 
employed for Description Logics; 

• it is possible to combine the tableau-based algorithms available for Description 
Logics with existing reasoning algorithms in the concrete domain in order to 
obtain the appropriate algorithms for the extension; 

• it provides a scheme for extending Description Logics by various concrete domains 
rather than constructing a single ad hoc extension for a specific concrete domain. 

In the following, we will first introduce the original proposal by Baader and 
Hanschke, and then describe two extensions of this proposal [Hanschke, 1992; 
Haarslev et al . , 1999]. 

6.2. 1.1 The family of Description Logics ACC(fD) 

Before we can define the members of this family of Description Logics, we 
must formalize the notion of a concrete domain. 

Definition 6.1 A concrete domain V consists of a set A®, the domain of 
V, and a set pred(V), the predicate names of V. Each predicate name P G 
pred(D) is associated with an arity n, and an n-ary predicate P v C (A v ) n . 

Let us illustrate this definition by examples of interesting concrete domains. 
Let us start with some numerical ones: 

• The concrete domain A/”, which we have employed in our introductory example, 
has the set N of all nonnegative integers as its domain, and pred(Af) consists of 
the binary predicate names <, <, >, > and the unary predicate names <„, <„, 
>„, > n for n G N, which are interpreted by predicates on N in the obvious way. 

• The concrete domain 1Z has the set R. of all real numbers as its domain, and the 
predicates of 1Z are given by formulae that are built by first-order means (i.e., 
by using Boolean connectives and quantifiers) from equalities and inequalities 
between integer polynomials in several indeterminates. For example, x + z 2 = y 
is an equality between the polynomials p(x, z) = x + z 2 and q{y) = y\ and x > 
y is an inequality between very simple polynomials. From these equalities and 
inequalities one can for instance build the formulae 3z.(x + z 2 = y) and 3 z.(x + 
z 2 = y) V (x > y) . The first formula yields a predicate name of arity 2 (since it has 
two free variables), and it is easy to see that the associated predicate is {(r, s) | 
r and s are real numbers and r < s}. Consequently, the predicate associated to 
the second formula is {(r, s) | r and s are real numbers} = RxK. 

• The concrete domain Z is defined just like 1Z , with the only difference that A z 
is the set of all integers instead of all real numbers. 
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In addition to numerical domains, Definition 6.1 also captures more abstract 
domains: 

• A given (fixed) relational database DB can be seen as a concrete domain VB , 
whose domain is the set of atomic values occurring in DB, and whose predicates 
are the relations that can be defined over DB using a query language (such as 
SQL). 

• One can also consider Allen’s interval calculus [Allen, 1983] as concrete domain 
IC. Here A IC consists of time intervals, and the predicates are built from Allen’s 
basic interval relations (such as before, after, . . . ) with the help of Boolean con- 
nectives. 

• Instead of time intervals one can also consider spatial regions (e.g., in RxK), 
and use as predicates Boolean combinations of the basic relations of the Region 
Connection Calculus RCC-8 [Randell et al. , 1992; Bennett, 1997]. 

Although the syntax and semantics of Description Logics extended by 
concrete domains could be defined with the general notion of a concrete 
domain introduced in Definition 6.1, the requirement that the extended lan- 
guage should still have decidable reasoning problems adds some additional 
restrictions. 

To be able to compute the negation normal form of concepts in the ex- 
tended language, we must require that the set of predicate names of the 
concrete domain is closed under negation , i.e., if P is an n-ary predicate 
name in predfiD ) then there has to exist a predicate name Q in predfiD) such 
that Q d = (A v ) n \ P v . We will refer to this predicate name by P. In addi- 
tion, we need a unary predicate name that denotes the predicate A v . The 
domain M from above satisfies these two properties since, e.g., < n = > n and 
(>o)^ = N. 

Let us now clarify what kind of reasoning mechanisms are required in the 
concrete domain. Let P\ , . . . , P^ be k (not necessarily different) predicate 
names in pred(T > ) of arities n i, . . . , nj~. We consider the conjunction 

k 

1=1 

Here a stands for an u-j-tuple (x± \ . . . ,r[^) of variables. It is important 
to note that neither all variables in one tuple nor those in different tuples 
are assumed to be distinct. Such a conjunction is said to be satisfiable iff 
there exists an assignment of elements of A v to the variables such that 
the conjunction becomes true in V. We will call the problem of deciding 
satisfiability of finite conjunctions of this form the satisfiability problem for 
V. 
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Definition 6.2 The concrete domain V is called admissible iff (i) the set 
of its predicate names is closed under negation and contains a name T x> for 
A v , and (ii) the satisfiability problem for V is decidable. 

With the exception of Z, all the concrete domains introduced above are 
admissible. For example, decidability of the satisfiability problem for 1Z is a 
consequence of Tarski’s decidability result for real arithmetic [Tarski, 1951; 
Collins, 1975]. In contrast, the undecidability of the satisfiability problem for 
Z is a consequence of the undecidability of Hilbert’s 10th problem [Matiya- 
sevich, 1971; Davis, 1973]. 

In the following, we will take the language ACC as the (prototypical) 
starting point of our extension. 1 In the following, let T> be an arbitrary (but 
fixed) concrete domain. The interface between ACC and the concrete domain 
is inspired by the agreement construct between chains of functional roles 
(see Chapter 2, Subsection 2.4.3). With this construct one can, for example, 
express the concept of all women whose father and husband are of the same 
age by the expression Woman n has-father o has-age = has-husband o has-age. 
However, one cannot express that the husband is even older than the father. 
This becomes possible if we take the concrete domain J\f . Then we can 
simply write 

Woman n 3(has-father o has-age, has-husband o has-age). <. 

More generally, our extension, called ACC(T>), will allow us to state that 
a tuple of chains of functional roles satisfies a (not necessarily binary) 
predicate, which is provided by the concrete domain in question. 

Thus, ACC(V) extends ACC in two respects. First, the set of role names 
is now assumed to be partitioned into a set of functional roles and a set 
of ordinary roles. Both types of roles are allowed to occur in value restric- 
tions and in the existential quantification construct. In addition, there is a 
new constructor, called existential predicate restriction , which is defined by 
adding to the syntax rules for ACC the rule 

C,D — * 3(ui ,...,u n ).P, 

where P is an n-ary predicate of T> and u i ,.. . , u n are chains of functional 
roles. When considering „4£C(D)-ABoxes, one must distinguish between 
names for abstract and for concrete individuals. Concrete predicates P 6 

1 All the definitions would, of course, also work for any other concept description language. The 
approach for combining the reasoning algorithms will work for many other languages, but not 
for all of them. 



Extensions to Description Logics 


243 


pred(V) give rise to additional ABox assertions of the form P{x i, . . . ,x n ), 
where xi, . . . ,x n are names for concrete individuals. 

Definition 6.3 An interpretation T for ACC(T>) consists of a set A 2 , the 
abstract domain of the interpretation, and an interpretation function. The 
abstract domain and the given concrete domain must be disjoint, i.e., A v fi 
A 2 = 0. As before, the interpretation function associates with each concept 
name a subset of A 2 and with each ordinary role name a binary relation 
on A 2 . The new feature is that the functional roles are now interpreted by 
partial functions from A 2 into A 2 U A v . If u = f\ o • • • o f n is a chain of 
functional roles, then u x denotes the composition / 2 o • • • o / 2 of the partial 
functions /f , . . . , / 2 . 

The semantics of the usual A£C-constructors is defined as before. In par- 
ticular, this means that complex concept descriptions are always interpreted 
as subsets of the abstract domain A 2 . The existential predicate restriction 
is interpreted as follows: 

(=](«!, . . . , u n ).PY = {i£ A 1 | there exist rq, . . . , r n € A D such that 

Ui(x) = ri, . . . , u T n (x) = r n and (r i, . . . , r n ) G P v }. 

Above, we have already seen two examples of concepts of ACC (AT). The 
following A£C(7£)-concepts describe rectangles and squares in 1 x 1: 

Rectangle = 3(x, y, b, h).rectangle-cond, 

Square = Rectangle n 3(b, h). equal, 

where the concrete predicates rectangle-cond and equal are defined 
as equal (x,y)<^x = y and recta ngle-cond(x, y, b, h) b > 0 A h > 0. In 
rectangle-cond, the first two arguments are assumed to express the x and 
y coordinates of the lower left corner of the rectangle, while the third and 
fourth argument express the breadth and height of the rectangle. We leave 
it to the reader to define the concept “pairs of rectangles” where the first 
component is a square that is contained in the second component. 

A tableau-based algorithm for deciding consistency of ACC(V)-ABoxes 
for admissible T> was introduced in [Baader and Hanschke, 1991b]. The al- 
gorithm has an additional rule that treats existential predicate restrictions 
according to their semantics. The main new feature is that, in addition to 
the usual “abstract” clashes, there may be concrete ones, i.e., one must 
test whether the given combination of concrete predicate assertions is non- 
contradictory. This is the reason why we must require that the satisfia- 
bility problem for T> is decidable. As described in [Baader and Hanschke, 
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1991b], the algorithm is not in PSpace. Using techniques similar to the 
ones employed for ACC it can be shown, however, that the algorithm can 
be modified such that it needs only polynomial space [Lutz, 1999b], pro- 
vided that the satisfiability procedure for V is in PSpace. In the presence 
of acyclic TBoxes, reasoning in ACC(V) may become NExpTlME-hard even 
for rather simple concrete domains with a polynomial satisfiability problem 
[Lutz, 2001b]. 

This technique of combining a tableau-based algorithm for the Descrip- 
tion Logics with a satisfiability procedure for the concrete domain can be 
extended to more expressive Description Logics (e.g., ACCM and ACCM 
with agreements and disagreements). However, this is not true for arbitrary 
Description Logics with tableau-based decision procedures. For example, the 
technique does not work if the tableau-based algorithm requires some sort of 
blocking (see Chapter 2, Subsection 2. 3. 2. 4) to ensure termination. Techni- 
cally, the problem is that concrete predicates can be used to state properties 
concerning different individuals in the ABox, and that blocking, which is con- 
cerned only with the properties of a single individual, cannot take this into 
account. The main idea underlying an undecidability proof for such a logic is 
that elements of the concrete domain (e.g., 7 Z) can encode configurations of 
a Turing machine and that one can define a concrete predicate stating that 
one configuration is a direct successor of the other. Finally, the Description 
Logic must provide some means of representing sequences of configurations 
of arbitrary length, which is usually the case for Description Logics requiring 
blocking. More concretely, it was shown in [Baader and Hanschke, 1992] (by 
reduction from Post’s correspondence problem) that satisfiability of concepts 
becomes undecidable if transitive closure (of a single functional role) is added 
to ACC(1Z ). Post’s correspondence problem can also be used to show un- 
decidability of ACC(1Z) with general inclusion axioms, although one cannot 
use exactly the same reduction as for transitive closure (see [Haarslev et al . , 
1998] for a similar reduction). A notable exception to the rule of thumb 
that concrete domains together with general inclusion axioms lead to un- 
decidability has recently been shown by Lutz [2001a], who combines ACC 
with the concrete domain of rational numbers with equality and inequality 
predicates. 


6.2. 1.2 Predicate restrictions on role chains 

The role chains occurring in predicate restrictions of ACC(V) are restricted 
to chains of functional roles. In [Hanschke, 1992] this restriction was re- 
moved. To be more precise, the syntax rules for ACC are extended by the 
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two rules 

C, D > 3(ui , . . . , U,f) .P | V(tti , . . . , M n ) -P, 

where P is an n-ary predicate of T> and u \, . . . , u n are chains of (not neces- 
sarily functional) roles. 

In this setting, ordinary roles are also allowed to have fillers in the concrete 
domain, i.e. , both functional and ordinary roles are interpreted as subsets 
of A x x (A 1 U A v ). Of course, functional roles must still be be interpreted 
as partial functions. The extension of the predicate restrictions is defined as 

(3(«i, . . . , u n ).P) x = {x € A 1 | there exist rq, . . . , r n € A v such that 

(x, ri) G uf , . . . , (x,r n ) G uf and (rq, . . . ,r„) G P v }, 

(V(ui, . . . ,u n ).P) x = {x G A 1 | for all rq, . . . ,r n : (x,ri) € uf,..., ( x,r n ) G uf 

implies (n, . . . , r„) G P v }. 

Using the universal predicate restriction one can, for example, define the 
concept of parents all of whose children are younger than 4 by the description 

Parent n Vhas-child o has-age. <4 . 

Hanschke [1992] shows that an extension of the Description Logic we have 
just introduced still has a decidable ABox consistency problem, provided 
that the concrete domain V is admissible. 

6.2. 1.3 Predicate restrictions defining roles 

In [Haarslev et al, 1998; 1999], ALC(V) was extended in a different direction: 
predicate restrictions can now also be used to define new roles. To be more 
precise, if P is a predicate of V of arity n + m , and u\, ... ,u n and v\, ... ,v m 
are chains of functional roles, then 

3(m, . . .,u n )(v 1 , . . .,v m ).P 

is a complex role. These complex roles may be used both in value restrictions 
and in the existential quantification construct. The semantics of complex 
roles is defined as 
(3(ui, ...,u n )(v 1 , . . . ,v m ).P) x = 

{(a;, y) G A x x A 1 \ there exist n, . . . , r n , si, , . . , s m G A® such that 

uf{x) =n ,...,uf(x) =r n ,vf(y) = s 1 , . = s m 

and (n,...,r„,Si,...,s TO ) G P v }. 

For example, the complex role 3(has-age) (has-age). > consists of all pairs 
of individuals having an age such that the first is older than the second. 

Unfortunately, it has turned out that the full logic obtained by this ex- 
tension has an undecidable satisfiability problem [Haarslev et al., 1998]. To 
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overcome this problem, Haarslev et al. [1999] define syntactic restrictions 
on concepts such that the restricted language (i) is closed under negation, 
and (ii) has a decidable ABox consistency problem. Consequently, the sub- 
sumption and the instance problem are also decidable. The complexity of 
reasoning in this Description Logic is investigated in [Lutz, 2001b]. As in the 
case of acyclic TBoxes, rather simple concrete domains can already make 
reasoning NExpTlME-hard. 

An approach for integrating arithmetic reasoning into Description Logics 
that considerably differs from the concrete domain approach described above 
was proposed by Ohlbach and Koehler [1999]. 


6.2.2 Modal extensions 

Although the Description Logics discussed so far provide a wide choice 
of constructors, usually they are intended to represent only static knowl- 
edge and are not able to express various dynamic aspects such as time- 
dependence, beliefs of different agents, obligations, etc. For example, in ev- 
ery standard description language we can define a concept “good car” as, 
say, a car with an air-conditioner: 

GoodCar = Car n dpart.Airconditioner. (6.1) 

However, we have no means to represent the subtler knowledge that only 
John believes (6.1) to be the case, while Mary does not think so: 

[John believes] (6.1) A -i [Mary believes] (6.1). 

Nor can we express the fact that (6.1) holds now, but in the future the 
notion of a good car may change (since, for instance, all cars will have air 
conditioners) : 


(6.1) A ( eventually ) -i(6.1). 

A way to bridge this gap seems quite clear and will be discussed in this and 
the next section: one can simply combine a Description Logic with a suitable 
modal language treating belief, temporal, deontic or some other intensional 
operators. However, there are a number of parameters that determine the 
design of a modal extension of a given Description Logic. 

(I) First, modal operators can be applied to different kinds of well-formed 
expressions of the Description Logic. 
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One may apply them only to conceptual and assertional axioms, thereby 
forming new axioms of the form 

[John 6eZiei>es] (GoodCar = Car n 3part.Airconditioner), 

[Mary believes ] (eventually) (Rich(JOHN)). 

Modal operators may also be applied to concepts in order to form new ones: 

[John believes] expensive 

i.e. , the concept of all objects John believes to be expensive, or 

HumanBeing n dchild. [Mary believes ] (eventually) GoodStudent 

i.e., the concept of all human beings with a child that Mary believes will 
eventually be a good student. By allowing applications of modal operators 
to both concepts and axioms we obtain expressions of the form 

[John believes]( GoodCar = [Mary believes] GoodCar) 

i.e., John believes that a car is good if and only if Mary thinks so. 

Finally, one can supplement the options above with modal operators appli- 
cable to roles. For example, using the temporal operator [always] (in future) 
and the role loves, we can form the new role [always] loves (which is under- 
stood as a relation between objects x and y that holds if and only if x will 
always love y) to say 

(3[always] loves. Woman) (JOHN) 

i.e., John will always love the very same woman (but perhaps not only her), 
which is not the same as ([aZways]Elloves.Woman)(JOHI\l). 

(II) All these languages are interpreted with the help of the possible 
worlds semantics, in which the accessibility relations between worlds (or 
points in time, . . . ) treat the modal operators, and the worlds themselves 
are Description Logic interpretations. 

The properties of the modal operators are determined by the conditions 
we impose on the corresponding accessibility relations. For example, by im- 
posing no condition at all we obtain what is known as the minimal normal 
modal logic K although of definite theoretical interest, it does not have 
the properties required to model operators like [agent A knows], (eventually) , 
etc. In the temporal case, depending on the application domain we may as- 
sume time to be linear and discrete (for example, the usual strict ordering of 
the natural numbers), or branching, or dense, etc. (see [Gabbay et al, 1994; 
van Benthem, 1996]). Moreover, we have the possibility to work with in- 
tervals instead of points in time (see Subsection 6.2.4). In epistemic logic, 
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transitivity of the accessibility relation for agent A’s knowledge means what 
is called positive introspection (A knows what A knows), euclideanness 
corresponds to negative introspection (A knows what A does not know), 
and reflexivity means that everything known by A is true; see Subsec- 
tion 6.2.3 for a formulation of these principles in terms of Description Log- 
ics. For more information and further references consult [Fagin et al, 1995; 
Meyer and van der Hoek, 1995]. 

(III) When connecting worlds - that is, ordinary interpretations of the 
pure description language - by accessibility relations, we are faced with the 
problem of connecting their objects. Depending on the particular applica- 
tion, we may assume worlds to have arbitrary domains (the varying domain 
assumption) , or we may assume that the domain of a world accessible from 
a world w contains the domain of w (the expanding domain assumption) , or 
that all the worlds share the same domain (the constant domain assump- 
tion) I; see [van Benthem, 1996] for a discussion in the context of first-order 
temporal logic. Consider, for instance, the following axioms: 

-<[agent A knows] (Unicorn = _L) 

([agent A knows ] -i Unicorn) = T. 

The former means that agent A does not know that unicorns do not exist, 
while according to the latter, for every existing object, A knows that it is 
not a unicorn. Such a situation can be modeled under the expanding domain 
assumption, but these two formulas cannot be simultaneously satisfied in a 
model with constant domains. 

(IV) Finally, one should take into account the difference between global 
(or rigid) and local (or flexible) symbols. In our context, the former are the 
symbols which have the same extension in every world in the model under 
consideration, while the latter are those whose interpretation is not fixed. 
Again the choice between these depends on the application domain: if the 
knowledge base is talking about employees of a company then the name John 
Smith should probably denote the same person no matter what world we 
consider, while President of the company may refer to different persons in 
different worlds. For a more detailed discussion consult, e.g., [Fitting, 1993; 
Kripke, 1980]. 

To describe the syntax and semantics more precisely we briefly introduce 
the modal extension C^ cc of ACC with n unary modal operators Di, . . . , 
and their duals Oi, „ , O n . 

Definition 6.4 (Concepts, roles, axioms) Concepts and roles of C'^ c 
are defined inductively as follows: all concept names are concepts, and if C, 
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D are concepts, R is a role, and O* is a modal operator, then C n D, —> C, 
OiC, and 3R.C are concepts. 2 All role names are roles, and if R is a role, 
then Dji? and O are roles. 

Let C and D be concepts, R a role, and a, b object names. Then ex- 
pressions of the form C = D, R(a,b), and C(a) are axioms. If <p and ip are 
axioms then so are Opp, —up, and <p A ip. 

We remind the reader that models of a propositional modal language are 
based on Kripke frames, i.e. , structures of the form # = (IF, < 1 , . . . , < n ) in 
which each <lj is a binary (accessibility) relation on the set of worlds IF. 
What is going on inside the worlds is of no importance in the propositional 
framework (see, e.g., [Chagrov and Zakharyaschev, 1997] for more informa- 
tion on propositional modal logics). Models of C'\ cc are also constructed 
on Kripke frames; however, in this case their worlds come equipped with 
interpretations of ACC. 

Definition 6.5 (model) A model of C r ^ c based on a frame 5 = 
(IK, <i, . . . , <l n ) is a pair 971 = (#, 1) in which I is a function associating 
with each w €E IF an A£C-interpretation 

I(w) = (A I,W , - I,w ). 

971 has constant domain iff A 1 ^ = A^ w \ for all v, w £ IF. 971 has expanding 
domains iff A 1 ^ C /\R W ) whenever v <], t w, for some i. 

Definition 6.6 For a model 971 = (5, 1 ) and a world w in it, the extensions 
C I,W and R I,W , and the satisfaction relation w |= ip (p an axiom) are defined 
inductively. The interesting new steps of the definition are: 

1. x G (O iC) I,w iff 3v. v\>iiv and x G C I,V ; 

2. ( x,y ) e ( OiR) I,w iff Bv. v >, w and (x,y) € R I,V ; 

3. w 1= O ip iff 3v. v w and v \= <p. 

An axiom <p (resp. a concept C) is satisfiable in a class of models M. if there 
is a model 971 G M and a world w in 971 such that w |= p> (resp. C I,W A 0 )- 

Given a class of frames /C, the satisfiability problems for axioms and concepts 
in 1C are the most important reasoning tasks; others are reducible to them 
(see [Wolter and Zakharyaschev, 1998; 1999b]). Notice that the satisfiability 

2 Note that value restrictions (the modal box operators Eh) need not explicitly be included here 
since they can be expressed using negation and existential restrictions (the modal diamond 
operators O^). 
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problem for concepts is reducible to that for axioms since —>{C = _L) is sat- 
isfiable iff C is satisfiable. Also, the satisfiability problem for models with 
expanding or varying domain is reducible to that for models with constant 
domain (see [Wolter and Zakharyaschev, 1998]). 

We are now going to survey briefly the state of the art in the field. We 
will restrict ourselves first to modal Description Logics which are not tem- 
poral logics. The latter will be considered in Subsection 6.2.4. Chronologi- 
cally, the first investigations of modal Description Logics are [Laux, 1994; 
Graber et al, 1995; Baader and Laux, 1995; Baader and Ohlbach, 1993; 
1995]. The papers [Laux, 1994; Graber et al, 1995] construct multi-agent 
epistemic Description Logics in which the belief operators apply only to 
axioms; the accessibility relations are transitive, serial, and euclidean. The 
decidability of the satisfiability problem for axioms follows immediately from 
the decidability of both the propositional fragment of the logic and ACC, 
because in languages without modalized concepts and roles there is no inter- 
action between the modal operators and role quantification (see [Finger and 
Gabbay, 1992]). Baader and Laux [1995] introduce a Description Logic in 
which modal operators can be applied to both axioms and concepts (but not 
to roles); it is interpreted in models with arbitrary accessibility relations un- 
der the expanding domain assumption. The decidability of the satisfiability 
problem for axioms is proved by constructing a complete tableau calculus. 
This tableau calculus was modified and extended for checking satisfiability 
in models with constant domain in [Lutz et al, 2002], It decides satisfiability 
in constant domain models in NExpTime, which matches the lower bound 
established in [Mosurovic and Zakharyaschev, 1999] (see also [Gabbay et al, 
2003]). 

The papers [Wolter and Zakharyaschev, 1998; 1999a; 1999c; 1999b; 
Wolter, 2000; Mosurovic and Zakharyaschev, 1999] investigate the decision 
problem for various families of modal Description Logics in detail. For ex- 
ample, in [Wolter and Zakharyaschev, 1999c; 1999b] it is shown that the 
satisfiability problem for arbitrary axioms (possibly containing modalized 
roles) is decidable in the class of all frames and in the class of polymodal 
S 5-frames - frames in which all accessibility relations are equivalence rela- 
tions - based on constant, expanding, and varying domains. It becomes un- 
decidable, however, if common-knowledge epistemic operators (in the sense 
of [Fagin et al, 1995]) are added to the language or if the class of frames 
consists of the flow of time (N, <). In [Wolter and Zakharyaschev, 1999a; 
1998] it is shown that for expressive modal languages - like logics with 
common knowledge operators or Propositional Dynamic Logics - the 
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satisfiability problem for axioms becomes decidable when modalized roles 
are not included. Wolter [2000] shows that the satisfiability problem for 
concepts interpreted in frames with global (i.e., world-independent) roles is 
decidable for expressive modal logics based on ACC while the satisfiability 
problem for axioms is undecidable for them. However, even the complexity 
of the satisfiability problem for concepts becomes non-element ary for these 
logics [Gabbay et al, 2003]. In fact, for various decidable modal Description 
Logics only computationally non-element ary decision procedures are known 
and the precise complexity has not yet been determined (consult [Gabbay 
et al, 2003] for further results). 

The papers [Baader and Ohlbach, 1993; 1995] introduce a multi- 
dimensional description language that is even more expressive than C^ cc 
(but without object names). Roughly, in this approach each dimension (ob- 
ject, time, belief, etc.) is represented by a set Di (of objects, moments of 
time, possible worlds, etc.), concepts are interpreted as subsets of the carte- 
sian product nr=i Di, and roles of dimension i as binary relations between 
n-tuples that may differ only in the ith coordinate. One can quantify over 
both roles and concepts, in any dimension. Thus, in contrast to C'\ cc ar- 
bitrarily many dimensions are considered and no dimension is labeled as 
the “modal” or “„4£C”-one. This language has turned out to be extremely 
expressive. The satisfiability problem for the full language is known to be 
undecidable and even for natural fragments no sound and complete reason- 
ing procedures have appeared. Baader and Ohlbach [1995] provide only a 
sound satisfiability-checking algorithm for such a fragment. 


6.2.3 Epistemic operators 

The systems Classic and Loom make it possible for their users to include 
procedural rules into knowledge bases (see also Chapter 2, Subsection 2.2.5). 
Such rules take the form 


C => D. 

where C and D are concepts. The meaning of a procedural rule is different 
from the meaning of an inclusion axiom: while CCD represents conceptual 
knowledge and says that - no matter what is known about individuals - the 
concept D subsumes C, the rule C =$■ D represents the incidental fact that 
“if an individual is known to be an instance of C, then we can conclude that 
it is an instance of D ” . Consider the following example: suppose a knowledge 
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base <5 consists of 

GreatLogician C Professor, ->Professor(a). 

Obviously we can derive ->Greatl_ogician(a) from $. In this representation we 
assume a conceptual relation between the terms ‘professor’ and ‘great logi- 
cian’. More appropriate, however, seems to be the weaker claim that people 
who are known to be great logicians are professors: let T 7 be the knowledge 
base which results from $ when GreatLogician C Professor is replaced with 

GreatLogician => Professor. 

The assertion ->GreatLogician(a) turns out not to be derivable from $ 7 . The 
procedural explanation for this phenomenon is this: in the knowledge base 
<b 7 we do not find an individual belonging to the concept GreatLogician. 
Therefore the rule GreatLogician =>■ Professor does not “fire” and nothing 
new about the world is derivable by using it. However, Description Logic is 
aiming at an extensional semantics for frame-based systems, hence it would 
be desirable to have a precise model-theoretic explanation of the behavior 
of procedural rules as well. 

It turns out that adding an epistemic operator together with a possible 
worlds semantics interpreting it provides us with the required models. In- 
tegrating the operator K - ‘the knowledge base knows that’ - into ACC 
will allow us to rephrase the rule GreatLogician =>• Professor by the inclu- 
sion axiom KGreatLogician C Professor, which says that all objects that are 
known to be great logicians are professors. Actually, it will turn out that 
extensions of Description Logics by means of epistemic operators are useful 
in other contexts as well. We postpone their discussion until we have in- 
troduced some technical prerequisites. We will follow [Donini et ai, 1992b; 
1998a], where the extension of ACC by epistemic operators was introduced 
and investigated. 

Formulated in terms of Subsection 6.2.2, we consider the language C\ cc 
in which the modal operator Di (now denoted by K) can be applied to 
concepts and roles but not to axioms. Following [Donini et ai, 1998a] we 
call this language ACC 1C. The following principles are assumed to govern 
the epistemic operator (we formulate them here for K applied to concepts; 
the formulation for roles is similar): 

• K C C C (only true facts are known: if an object is known to be an instance of 
C, then it is an instance of C); 

• K C C KKC (positive introspection: if it is known that an object is an instance 
of C, then this is known); 

• C C K^KC (negative introspection: if it is not known whether an object is 
an instance of C, then this is known). 
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These principles are valid in all models based on a Kripke frame 5 = (kb, <l) 
iff J is an S5-frame, or, equivalently, if <3 is the universal relation on IT, 
i.e. , <3 = IT x IT. So, we consider frames of the form (IT, IT x IT) only. 

We assume also that: 

• it is known which object an object name denotes (so, object names are assumed 
to be global (or rigid) designators), 

• the set of existing objects A is known and countably infinite (so, we adopt the 
constant domain assumption). 

These assumptions together allow us to simplify the possible worlds seman- 
tics considerably: we can identify the set of worlds IT with a set of inter- 
pretations A4 (all having the same countably infinite domain A and the 
same interpretation of the object names) and the accessibility relation is 
implicitly given as the universal relation on JCi. Hence, we call any set of 
interpretations M satisfying these constraints a model (for ACC 1C) and can 
define the extensions and R x - M of a concept C and a role R in an 


interpretation 2 in 

M as follows: 

a x M = 

A 1 for atomic concepts A 

pT,M = 

P x for atomic roles P 

(-. C) ™ = 

A \ C™ 

(Ci n Cfi) IM = 

cl M n cl M 

( 3R.C )™ = 

{a e A I 36. (a, 6) g R™ A 6 G C™} 

(KC)™ = 

p| C jm ( = {a G A | VJ G M. a G C jm }) 


J£M 

(K R)™ = 

pi R j m ( = {(a, 6) g A | yj g M. (a, 6) e R jm }) 


J&M 


So, KC comprises the set of all objects that are instances of C in every 
world regarded as possible. 

An ACCIC-knowledge base $ consists of a set of inclusion axioms and ABox 
assertions whose concepts and roles are in ACC 1C. A model M. satisfies 
(is a <h-model) iff all inclusion and membership assertions of are true in 
every 2 G M. 

So far, we have introduced a rather simple version of the epistemic ex- 
tensions of ACC discussed in Subsection 6.2.2. In the present subsection, 
however, we are not interested in the satisfiability of epistemic knowledge 
bases, but in a relation |= between knowledge bases and assertions such 
that $ |= ip iff a knowledge base knows p under the assumption that “all 
the knowledge base knows is . For example, if $ is empty (the knowledge 
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base knows nothing), then both — iKC(a) and ->K-i C(a) should be deriv- 
able, since the knowledge base does not know whether a is an instance of 
C or not. On the semantic level this means that we are not interested in 
arbitrary models satisfying $ but only in those <h-models that refute as 
many A£C-assertions as possible. In other words, we consider T-models 
only with as many worlds as possible (corresponding to the intuition that 
more worlds are regarded as possible if less is known). For example, if $ is 
empty, then the intended models comprise all interpretations (with a fixed 
domain and interpretation of the object names), since all interpretations 
are regarded as possible by an empty knowledge base. Here are the precise 
definitions: 

Definition 6.7 An epistemic model for $ is a maximal non-empty set of 
interpretations M. satisfying <b. The knowledge base $ logically implies an 
assertion p, written $ |= p, if every epistemic model At for satisfies p. 

Consequently, |= is a non-monotonic consequence relation: although 0 |= 
(-■K C A -iK-iC')(a), we have C(a ) |= K C{a). On the propositional level, this 
type of reasoning is known as ground non-monotonic S5 (see [Donini et ai, 
1995; 1997c; Nardi and Rosati, 1995]). 

Reasoning with arbitrary A£C/C-knowledge-bases has not been investi- 
gated. In fact, all applications considered in the literature require only very 
small fragments of ACCfC. In what follows, we shall briefly introduce two 
such fragments and some of their applications. 

6.2.3. 1 ACCK, as a query language 

We first confine ourselves to knowledge bases that are ordinary A£C-ABoxes. 
Hence, the epistemic operator K can be used only in queries. Recall that 
concept languages can be applied as query languages in a straightforward 
manner: the answer set of a query consisting of a concept C to a knowl- 
edge base $ comprises the set of individuals a with $ |= C(a). Queries with 
epistemic operators enable us to extract the knowledge which the knowledge 
base has about its own knowledge. Consider, for example, the knowledge base 
$ = {Elfriend.Male(SUSAN)}, which contains incomplete information about 
Susan. Applications of K to different concepts and roles in Elfriend.Male en- 
able us to form a variety of different queries: 

• Elfriend.Male; clearly, the answer to this query is {SUSAN}. 

• Elfriend.KMale; the answer set is empty, since no known male is a friend of Susan. 

• ElKfriend.Male; the answer set is empty since we do not find a male individual 
who is known to be a friend of Susan. 
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• KUfriend.Male; the answer set is {SUSAN} since the knowledge base knows that 
Susan has a friend who is male. 

Observe that, for <h / = 4>U {friend(SUSAN, BOB), Male(BOB)}, the answer 
set would consist of SUSAN in all four cases. We refer the reader to [Donini 
et al. , 1992b; 1998a] for more examples. 

Epistemic queries can also be used to formulate integrity constraints. Re- 
call that integrity constraints can be viewed as epistemic sentences that 
state what a knowledge base must know about the world [Reiter, 1990]. For 
example, suppose that we want to rule out those knowledge bases that are 
uncertain about whether a given course is a course for undergraduates or 
graduates. This can be expressed using the query 

-■KCourse U (KUndergraduate U KGraduate). (6-2) 

A knowledge base satisfies the integrity constraint iff it logically implies 
the assertion (6.2) (a), for every object name a appearing in it. Observe, by 
the way, that the query ->Course U (Undergraduate LI Graduate) has a differ- 
ent meaning: although 0 |= (6.2) (a), for all a (corresponding to the inten- 
tion), 0 [A (-iCourse U (Undergraduate U Graduate))(a). We refer the reader 
to [Levesque, 1984; Lifschitz, 1991; Reiter, 1990] for a discussion of the use 
of epistemic queries in general. 

What is the computational complexity of querying A£C-ABoxes by means 
of A£C/C-concepts? The following result is proved in [Donini et al., 1992b; 
1998a]: 

Theorem 6.8 There is an algorithm for deciding, given an ACC-ABox E, 
an object name a, and an ACC/C- concept C, whether E (= C(a). More pre- 
cisely, the problem E |= C(a) is PSpace -complete (w.r.t. the size of C and 
Sj. 

Recall that querying A£C-ABoxes with A£C-concepts is PSPACE-complete 
as well [Hollunder, 1996]. Thus, the additional epistemic operators in queries 
do not cause any increase of the computational complexity. 

6. 2. 3. 2 Semantics for procedural rules 

To capture the meaning of procedural rules as discussed above (and in Chap- 
ter 2, Section 2.2.5), we must admit assertions of the form KC C D in the 
knowledge base. A rule ABox consists of an A£C-ABox and a set of sen- 
tences of the form 


K C L D, 
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where C , D are A£C-concepts and C is not equivalent to T (the reason for 
this technical condition will be discussed below). 

Fortunately, the additional inclusion axioms again do not lead to any 
increase of the complexity [Donini et al, 1992b; 1998a]. 

Theorem 6.9 There is an algorithm for deciding, given a rule ACC-ABox 
X, an object name a, and an ACC 1C- concept C, whether X j= C(a). More 
precisely, the problem X |= C(a) is PSpace -complete (w.r.t. the size of C 
and Xj. 

Observe that this result does not extend to the language with inclusion 
axioms of the form KC C D, where C is equivalent to T. In this case K C 
would be equivalent to T as well, and so KC C D would be equivalent to 
D = T. However, for knowledge bases with axioms of this type instance 
checking is known to be ExpTlME-complete [Schild, 1994]. Notice that in 
applications a rule of the form T C does not make sense. 

6. 2. 3. 3 An extension of ACC K, 

The non-monotonic logic MKNF is an expressive extension of ground non- 
monotonic S5, which can simulate in a natural manner Default Logic, 
Autoepistemic Logic, and Circumscription (see [Lifschitz, 1994]). This is 
achieved by adding to classical logic not only the operator K (of ground non- 
monotonic S5) but also a second epistemic operator A, which is interpreted 
in terms of autoepistemic assumption. The papers [Donini et al., 1997b; 
Rosati, 1998] study the corresponding bimodal extension of ACC by means 
of K and A, called ACCB in what follows. 

We first consider the two operators K and A separately: the consequence 
relation |= for assertions containing K only is still the one introduced above. 
On the other side, for assertions containing A (‘it is assumed that’) only we 
are interested in a consequence relation \= AE such that 3 $ \= AE ip iff <p 
belongs to every stable expansion of <h, i.e., iff (p belongs to every reason- 
able theory 4 about the world which a rational agent who assumes only the 
assertions in $ can have. In particular, it is assumed that agents are ca- 
pable of introspection. Consider, for example, an agent assuming precisely 
$ = {AC = T} ( “the set of all objects I assume to be in C comprises all 
existing objects”). We still assume that agents know which objects exist (the 

3 AE indicates that autoepistemic propositional logic in the sense of [Moore, 1985] is extended 
here to ACC. 

4 In terms of propositional logic a theory T is called reasonable iff the following conditions 
hold: (0) T is closed under classical reasoning, (1) if P E T, then A P E T, (2) if P 0 T, then 
-<AP E T. 
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constant domain assumption). Hence $ can be rephrased as “I assume that 
all objects belong to C " . Now, according to the autoepistemic approach such 
an agent cannot have a coherent theory about the world, for then she would 
have to assume as well that C = T from the very beginning. 

From the “possible worlds” viewpoint the relation \= AE can be captured 
as follows. Firstly, the extension of ACC by A is interpreted in pairs (X, At ) 
in precisely the same manner as ACCKL. However, now we allow that the 
actual world X is not in At - corresponding to the idea that assumptions 
(in contrast to known assertions) are not always true. Thus we may have 
(A C) X ' M = T but C X ' M A T, which is not possible for K. The intended 
models are called AE-models in what follows. 

Definition 6.10 An AE-model for a set of assertions $ is a set of interpre- 
tations At that satisfies $ and such that, for every interpretation X 0 Af, 
$ is refuted in (X, At). Now put $ \= AE ip iff <p is satisfied in all AE-models 
for $. 

So, we do not maximize the set of possible worlds, but we exclude the case 
that $ is true in an actual world that is not regarded as possible (i.e., is 
not a member of Af). The consequence relation \= AE is also non-monotonic 
since 0 \= AE -iA C(a) but C(a ) |= AC (a). Observe that |= and \= AE are dif- 
ferent: while AC = T has no AE-models, KC = T has the epistemic model 
consisting of all interpretations in which C = T. 

How should one interpret the combined language ACCB and define a 
consequence relation? Following Lifschitz [1994], the intended models (called 
AXCH-models) are defined as follows. 

Definition 6.11 The ACCB-models for a set of AXCH-assertions $ are 
those models Af satisfying $ and the following maximality condition: if a 
non-empty set of new worlds A f is added to Af , K is interpreted in the model 
Af U J\T, and A is interpreted in the old model Af , then $ is refuted in some 
interpretation from A f. Now $ logically implies p, in symbols $ |= <p, iff p 
is satisfied in every ACCB- model satisfying $. 

Thus, roughly speaking, we still maximize the set of worlds, but now we 
require that any larger set of possible worlds contains a world at which <3? is 
refuted under the interpretation of A by means of the original set of possible 
worlds. But this corresponds, for the operator A, to the definition of AE- 
models. Clearly, the new consequence relation is a conservative extension 
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of the one defined for ACC 1C above (and of \= AE as well). Hence using the 
same symbol for both does not cause any ambiguity. 

The new logic is considerably more expressive than ACC 1C. Donini et al., 
[1997b] show that Default Logic can be embedded into ACCB more nat- 
urally than into ACC 1C. They also consider the formalization of integrity 
constraints in knowledge bases, which cannot be expressed in ACC 1C. and 
they discuss how role and concept closure can be formalized in ACCB. Here 
we confine ourselves to a brief discussion of the formalization of integrity 
constraints in ACCB. Above we have seen that the query (6.2) can be used 
to express the constraint that every course known to the knowledge base 
should be known to be for undergraduates or graduates. Sometimes it is 
more useful not to formalize integrity constraints as queries, but as part of 
the knowledge base (see [Donini et al, 1997b]). However, the addition of 
constraints should not change the content of the knowledge base, but just 
force the knowledge base to be inconsistent iff the constraint is violated. 
How can this be achieved in ACCfCl The naive idea is to add the assertion 
(6.2) eT to the knowledge base in order to express the constraint. Un- 
fortunately, this does not work: consider the knowledge base $ consisting 
of Course(a), which does not satisfy the integrity constraint. However, the 
knowledge base obtained from $ by adding (6.2) = T does not tell us that 
the constraint is violated in since the extended knowledge base is still con- 
sistent: the set M consisting of all interpretations J (with a fixed domain 
and interpretation of a) satisfying aA € Course* 7 D Graduate 1 - 7 is an epistemic 
model for the extended knowledge base. In fact, there is no way to formu- 
late the required constraint within ACCK. On the other hand, by adding the 
ACCB- assertion 

KCourse C AGraduate U AUndergraduate 

to d>, we obtain a knowledge base without AXCS-models, as required. Note, 
for example, that the model M. introduced above is not an ACCB - model for 
this knowledge base because any set of worlds M = {X} with X 0 A4 and 
a 1 £ Course 7 refutes the maximality condition. 

Donini et al. [1997b] present a number of decidability results for reasoning 
with ACCB knowledge bases. 


6.2.4 Temporal extensions 

Temporal extensions are a special form of modal extensions of Description 
Logics. However, because of the intended interpretation in flows of time they 
have a specific flavor, which is slightly different from general modal logic. 
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Chronologically, the first example of a “nrodalized” Description Logic was 
the temporal Description Logic of Schmiedel [1990] . The papers [Bettini, 
1997; Artale and Franconi, 1994; 1998] introduce and investigate variants 
of Schmiedel’s formalism. The papers mentioned so far employ an interval- 
based, approach to the semantics of temporal operators. Point-based tem- 
poral Description Logics have been introduced by Schild [1993] and further 
investigated by Wolter and Zakharyaschev [l999e]. 

For simplicity, let us first consider propositional temporal logic and then 
see how it can be extended to temporal Description Logic. In what follows 
we assume that a flow of time T = (T, <) consists of a set of points in time 
T and a precedence relation < between points in time which is assumed to 
be a strict linear order. This corresponds to the intuition that, for any two 
moments tifl -2 G T, either t\ precedes t 2 , t ,2 precedes t\, or t\ equals A 

How should one define a satisfiability relation |= between entities in a flow 
of time and formulae? There exist (at least) two different ways to select the 
entities at which formulas are evaluated: points in time and intervals. While 
in the first case we are considering a relation t (= cp between time-points t 
and formulas <p, in the second case we have a relation \u, v\ j= ip between 
intervals [u, v\ = {z G T \ u < z < v}, where u < v, in T and formulae <p. 
Denote by T* the set of all intervals in X. Both point- and interval-based 
temporal logics are special instances of modal logics: in the former the worlds 
of Kripke frames are interpreted as time-points while in the latter they are 
interpreted as intervals. Both point- and interval-based temporal models are 
easily extended to temporal ACC-models: 

Definition 6.12 A point-based temporal ACC-model 911= (X, I) consists 
of a flow of time X and a function I which associates with every t G T an 
interpretation 

I(t) = ( A 7 -*,- 7 -*). 

An interval-based temporal ACC-model 911 = (T, I) consists of a flow of time 
T and a function / which associates with every interval t £ X* an interpre- 
tation 

I(i) = (A 7 ?, 

We can now evaluate ACC-concepts and axioms in point- and interval- 
based temporal models. For example: 

• (9ft, t) \= Alive(a) iff a 1 ' 1 G Alive 1 ' 4 , i.e. , a is alive at moment t 

• (91t, i) ]= Sleep(a) iff a 1 ’ 1 G Sleep 7 ’*, i.e., a is sleeping in the interval i. 
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We now add temporal operators and quantifiers to ACC, which enable us to 
relate different moments and intervals to each other. 

For the point-based approach we have discussed appropriate operators 
already: we can form the language C^^c and interpret the operator □ = 
□ i as “ always in the future" . Thus, t |= □(C' = D) iff t' \= C = D for all 
t! > t, (always in the future of t, C and D are interpreted as the same set), 
and x £ (OC) 1 ^ iff there exists t' > t such that x £ C 1,1 (eventually x is 
an instance of C ). Often, however, more expressive temporal operators are 
required. The operator U (until), for example, is a binary temporal operator 
with the following truth-conditions, for all concepts C, D and axioms (p, if: 

1. a: £ ( CUD ) 7,4 iff there exists t' > t such that x £ D 7,4 and, for all t" with 
t<t"< t' , x £ C 7 ’ 4 ", 

2. t \= i pUif iff there exists t! > t, such that tf |= if and, for all t" with t <t" < t' , 

t" H v- 

In this language we can define a mortal as, say, a living being that is alive 
until it dies: 

Mortal = LivingBeing n (LivingBeing U □-iLivingBeing). 

This language, interpreted in the flow of time (N, <), was first considered 
by Schild [1993], who showed that the satisfiability problem for concepts 
(without modalized or global roles) is decidable. Wolter [2000] proves the 
decidability for concepts with global roles (but without modalized roles). 
However, the complexity of the decision problem for this language is non- 
elementary [Gabbay et al, 2003]. Wolter and Zakharyaschev [l999e] prove 
that even for axioms the satisfiability problem is decidable, provided that 
they do not contain modalized or global roles. Tableau calculi (running in 
double-exponential time) for the case of expanding and constant domains 
were developed in [Sturm and Wolter, 2002; Lutz et al . , 2001b]. The satisfi- 
ability problem for axioms in the full language with the flow of time (N, <) 
is undecidable. 

For the interval-based approach we find both languages that extend ACC 
by means of temporal operators which are interpreted by accessibility re- 
lations between intervals [Bettini, 1997] and languages that allow explicit 
quantification over intervals [Schmiedel, 1990; Artale and Franconi, 1994; 
1998]. 

We start the discussion with the temporal operators approach. Bet- 
tini [1997] extends the propositional interval-based temporal logic of 
[Halpern and Shoharn, 1991] to ACC (and weaker Description Logics). Thus, 
given a concept C, we can now form new concepts like (starts) C and 
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( finishes ) C. They are interpreted in interval-based models (T, I) as follows: 

• x £ ((starts) iff 3t £ T. u < t < v A x € C I ^ u,i 1 

(a; is an instance of ( starts ) C in the interval [u, t>] iff x is an instance of C in some 
interval starting [it, i>]), 

• x £ ((finishes) C') /, [“’d iS 3t £ T. u < t < v A x £ C 

In other words, the modal operators (starts) and (finishes) are interpreted in 
the standard “possible worlds manner” by means of the accessibility relations 
“starts” and “ finishes ”, respectively, where (■ i,j ) E starts iff j starts z and 
(i, j) £ finishes if j finishes i. By adding the converse operators of (starts) 
and (finishes) to the language, we obtain a language that can express all the 
thirteen Allen relations between intervals [Allen, 1983]. Here is a definition 
of Mortal in this language: 

Mortal = LivingBeing n (after) ->LivingBeing. 

Unfortunately, for the full language based on ACC the satisfiability prob- 
lem for concepts is undecidable in all interesting flows of time. This follows 
from the fact that propositional interval-based temporal logic is undecid- 
able already in (M, <}, (Q, <), (N, <}, etc. (see [Halpern and Shoharn, 1991]). 
However, there are numerous open decision problems when Description Log- 
ics weaker than ACC and different notions of intervals are considered (see 
[Bettini, 1997; Artale and Franconi, 2000; 2001]). 

Now, let us consider interval-based temporal extensions of Description 
Logics that allow explicit quantification over intervals. Schmiedel [1990] de- 
velops an expressive formalism 5 in which we have two quantifiers D(z) (“for 
all intervals i”) and O (i) (“there exists an interval z”), where z is a variable 
ranging over intervals. The language does not contain negation so that the 
quantifiers are not mutually definable. The quantifiers are relativized (alias 
bounded or guarded) by so-called time nets , which can, for example, be 
some relations like starts or finishes between intervals (metric and granu- 
larity constraints are admitted as well) . An operator @ specifies the interval 
at which a concept applies to an object and fl denotes a reference interval. 
The following concept can be regarded as a definition of the concept Mortal 
in Schmiedel’s language: 

LivingBeing n (O(z) (after z Jj) ( — i Livi ngBei ng @ z)). 

Here (after z ft) is the time net which relativizes the quantifier O(z) by means 
of the constraint expressing that z must be after the reference interval de- 
noted by ft. According to this definition, an object x is an instance of Mortal 


* Here and in what follows we use the notation of [Artale and Franconi, 1998]. 



262 


F. Baader, R. Kiisters, and F. Wolter 


at the reference interval fl iff x is living at jj and there exists an interval i 
that is after fj, and at which x is not living. 

Schmiedel [1990] does not address computational problems for his lan- 
guage. However, it is not difficult to see that, in the presence of negation, this 
language is more expressive than that of Bettini [1997] considered above - 
and thus subsumption is undecidable for all interesting flows of time. The 
decision problem for the language without negation appears to be open. 

A brief remark concerning the relation between interval-based temporal 
logic with and without explicit quantification over intervals is in order. Of 
course, explicit quantification provides more expressive power. Using the 
temporal operators introduced above, it is not possible to represent re- 
lations between more than two intervals because referring to a fixed ref- 
erence interval is impossible. On the other hand, variable-free languages 
are much closer in spirit to pure Description Logics and therefore seem 
to be more natural candidates for temporalizations of Description Log- 
ics; we refer the reader to [Artale and Franconi, 2000; 2001] for a detailed 
discussion. 

The papers [Artale and Franconi, 1994; 1998] present a number of lan- 
guages weaker than Schmiedel’s with a decidable subsumption problem. 
Among others, they define a temporal extension of a Description Logic ex- 
tending ACC with functional roles. They show decidability of concept sub- 
sumption and PSPACE-completeness of satisfiability w.r.t. an empty KB in 
an unbounded and dense flow of time. The main reason for the decidability 
is that the language does not admit universal quantification over intervals 
and that the constructors of the underlying Description Logic cannot be 
applied to the temporalized part of the language. In particular, the negation 
of the underlying Description Logic cannot be used to define the universal 
quantifier by means of the existential one. The authors show by means of a 
number of examples that their formalism still has enough expressive power 
to represent non-trivial actions and plans. 

An interesting feature of the subsumption algorithm presented by Artale 
and Franconi [1998] is that it consists of two parts: firstly, a normalization 
procedure is employed to reduce the subsumption problem for the temporal- 
ized Description Logic to that problem for the pure Description Logic, which 
can then be solved with known algorithms [Hollunder and Nutt, 1990]. 

For a more detailed survey of the state of art in temporal Description Logic 
we refer the reader to [Artale and Franconi, 2000; 2001], where one can also 
find an introduction to the work of Weida and Litrnan [1992], who propose a 
loose hybrid integration between Description Logics and constraint networks 
with the aim of reasoning about plans. 
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6.2.5 Representing uncertain and vague knowledge 

Description Logics whose semantics is based on classical first-order logic 
cannot express vague or uncertain knowledge. To overcome this deficiency, 
approaches for integrating probabilistic logic and fuzzy logic into Descrip- 
tion Logics have been proposed. Although both types of approaches assign 
numerical values to entries in the knowledge base, they are quite different, 
not only from a technical point of view, but also w.r.t. the basic phenomena 
they are trying to model. We talk about uncertainty if we deal with propo- 
sitions that are either true or false, but due to a lack of information we do 
not know for certain which is the case. This gives rise to statements about 
the probability with which a proposition is assumed to be true. In contrast, 
vagueness means that the propositions themselves are only true to a certain 
degree. This vagueness is not caused by incomplete knowledge; it is due to 
the fact that fuzzy notions, i.e. , notions without crisp boundaries (e.g., tall 
person) are modeled. 

In the following, we will restrict our attention to the probabilistic ex- 
tensions of Description Logics introduced in [Heinsohn, 1994; Jaeger, 1994; 
Roller et al, 1997; Yelland, 2000] and the fuzzy extensions of Description 
Logics introduced in [Yen, 1991; Tresp and Molitor, 1998; Straccia, 1998; 
2001]. The possibilistic extension by Hollunder [1994b] can be viewed as 
lying between these two approaches: possibilistic logic is mainly used to 
model uncertainty, but its formal semantics is defined in terms of fuzzy sets 
of interpretations. 


6.2.5. 1 Probabilistic extensions 

Let us first concentrate on how to extend the terminological (TBox) for- 
malism. In classical Description Logics, one has very restricted means of 
expressing (and testing for) relationships between concepts. Given two con- 
cepts C and D, subsumption tells us whether C is contained in D, and the 
satisfiability test (applied to C n D) tells us whether C and D are disjoint. 
Relationships that are in between (e.g., 90% of all Cs are Ds) can neither 
be expressed nor be derived. 

This deficiency is overcome in [Heinsohn, 1994; Jaeger, 1994] by allowing 
probabilistic terminological axioms of the form 6 

P (C\D)=p, 

where C, D are concept descriptions and 0 < p < 1 is a real number. Such 
an axiom states that the conditional probability for an object known to be 

6 Actually, Heinsohn uses a different notation and allows more expressive axioms stating that 
P(C\D) belongs to an interval \pi,p u \, where 0 < pi < p u < 1. 
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in D to belong to C is p. A given finite interpretation X satisfies P(C\D) = p 
iff 


\(CnDf 

\D X \ 


= p. 


More generally, the formal semantics of the extended language is defined in 
terms of probability measures on the set of all concept descriptions (modulo 
equivalence) . 

Given a knowledge base V consisting of probabilistic terminological ax- 
ioms, the main inference task is then to derive optimal bounds for additional 
conditional probabilities. Intuitively, 


V\=P(C\D)e\p,q} 

iff in all probability measures satisfying V the conditional probability 
P(C'I-D) belongs to the interval \p,q\. Given V,C,D , one is interested in 
finding the maximal p and minimal q such that V |= P(C\D ) £ [p, q] is true. 

Heinsohn [1994] introduces local inference rules that can be used to derive 
bounds for conditional probabilities, but these rules are not complete, that 
is, in general they are not sufficient to derive the optimal bounds. 

Jaeger [1994] only describes a naive method for computing optimal 
bounds. A more sophisticated version of that method reduces the inference 
problem to a linear optimization problem. In the following, we will sketch 
the main idea underlying this reduction. Assume that C \, . . . , C m are the 
concept descriptions occurring in V and P(C\D), and consider all conjunc- 
tions D\ n • • • n D m , where Di is either C t or -> C*. Let 21 be the set of those 
conjunctions that are satisfiable. Given a probability measure on all concept 
descriptions, the value of this measure on C\, . . . . C m is uniquely determined 
by the value on 21. To be more precise, its value for C\ can be obtained as 
the sum of the values for those elements of 21 that are subsumed by C\ (i.e. , 
the ones where C\ occurs positively). The idea is to introduce a numerical 
variable xt (ranging over the real interval (0,1)) for each element t £ 21. 
For example, if C \ , C 2 are two concept names, then 21 consists of the four 
elements to = ~'C\ I~1 — 1C2, ti = ~~ 'Ci I~1 C 2 , t .2 = C\ n ~>C 2 , and 1 3 = C \ n C 2 , 
for which we introduce the variables xo,xi,X 2 ,xs, respectively. Thus, the 
probability associated with Ci n C2 is X3 and the one for C2 is x\ + X3. 
Consequently, the probabilistic terminological axiom P(C'i|C , 2) = 0.7 can be 
represented by the (linear) constraint X 3 = 0.7(xi + £3). 

We have to find the maximal and minimal values that P(C'|D) attains on 
the set of values (xq, . . . , x n ) satisfying the linear constraints induced by V. 
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The value of the function P(CjD) (in terms of the variables xf) is given by 


^2{xt IteSlAtcCn D} 

| t G 21 A t E D} 

By a simple transformation, this fractional optimization problem can be 
transformed into a linear optimization problem [Amarger et al, 1991]. 

Jaeger [1994] also extends the assertional formalism by allowing proba- 
bilistic assertions of the form 


P(C'(o))=p, 

where C is a concept description, a an individual name, and p a real num- 
ber between 0 and 1. It should be noted that this kind of probabilistic 
statement is quite different from the one introduced by the terminological 
formalism. Whereas probabilistic terminological axioms state statistical in- 
formation , which is usually obtained by observing a large number of objects, 
probabilistic assertions express a degree of belief in assertions for specific in- 
dividuals. The formal semantics of probabilistic assertions is again defined 
with the help of probability measures on the set of all concept descriptions, 
one for each individual name. Intuitively, the measure for a tells us for each 
concept C how likely it is (believed to be) that a belongs to C. 

Given a knowledge base V consisting of probabilistic terminological ax- 
ioms and assertions, the main inference task is now to derive optimal bounds 
for additional probabilistic assertions. However, if the probabilistic termino- 
logical axioms are supposed to have an impact on this inference problem, 
the semantics as sketched so far is not sufficient. In fact, to date there is no 
connection between the probability measure used for the terminological part 
and the measures for the assertional part. Intuitively, one wants the measures 
for the assertional part to “most closely resemble” the measure for the termi- 
nological part, while not violating the probabilistic assertions. Jaeger [1994] 
uses cross entropy minimization in order to give a formal meaning to this 
intuition. To date, there is no algorithm for computing optimal bounds for 
P(C(a)), given a knowledge base consisting of probabilistic terminological 
axioms and assertions. 

The work reported in [Koller et al., 1997], which is restricted to the ter- 
minological component, has a focus that is quite different from the one in 
[Heinsohn, 1994; Jaeger, 1994], In the latter work, the probabilistic termi- 
nological axioms provide constraints on the set of admissible probability 
measures. However, these constraints may still be satisfied by a large set 
of distributions, and hence the optimal interval entailed for the probabili- 
ties of interest can be fairly large. In contrast, Koller et al. [1997] present a 
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framework for the specification of a unique probability distribution on the 
set of all concept descriptions (modulo equivalence). Since there are infinitely 
many such descriptions, providing such a (finite) specification is a nontrivial 
task. The basic idea is to specify a distribution on concepts of role-depth 0, 
and then to specify how to extend a distribution on concepts of role-depth n 
to one on concepts of role-depth n + 1. Koller et al. [1997] employ Bayesian 
networks as the basic representation language for the required probabilistic 
specifications. The probability P(C) of a concept description C can then be 
computed by using inference algorithms developed for Bayesian networks. 
The complexity of this computation is linear in the length of C. Under cer- 
tain restrictions on the Bayesian networks used in the specification, it is 
polynomial in the size of that specification. 

Yelland [2000] also combines Bayesian networks and Description Logics. 
In contrast to [Koller et al., 1997], this work extends Bayesian networks by 
DL features rather than the other way round. The Description Logic used 
in [Yelland, 2000] is rather inexpressive, but this allows the author to avoid 
restrictions on the network that had to be imposed by Koller et al. [1997]. 

6. 2. 5. 2 Fuzzy extensions 

The concepts in Description Logics are interpreted as crisp sets, i.e., an in- 
dividual either belongs to the set or not. However, many “real-life” concepts 
are vague in the sense that they do not have precisely defined membership 
criteria. Consider, for example, the concept of a tall person. It does not make 
sense to fix an exact boundary such that persons of height larger than this 
boundary are tall and others are not. In fact, what about a person whose 
height is 1 millimeter below the boundary? It is more sensible to say that 
an individual belongs to the concept “tall person” only to a certain degree 
n £ [0,1], which depends on the height of the individual. This is exactly 
what fuzzy logic allows one to do. 

The main idea underlying the fuzzy extensions of Description Log- 
ics proposed in [Yen, 1991; Tresp and Molitor, 1998; Straccia, 1998; 
2001] is to leave the syntax as it is, but to use fuzzy logic for defining the se- 
mantics. Thus, an interpretation now assigns fuzzy sets to concepts and roles, 
i.e., concept names A are interpreted by membership degree functions of the 
form A x : A x — > [0, 1], and role names R by membership degree functions of 
the form R 1 : A 1 x A 1 — > [0, 1]. The interpretation of the Boolean operators 
and the quantifiers must then be extended from {0, 1} to the interval [0, 1]. 
Fuzzy logics provide different options for such an extension. In [Yen, 1991; 
Tresp and Molitor, 1998; Straccia, 1998; 2001], the usual interpretation of 
conjunction as minimum, disjunction as maximum, negation as Ax.(l — x ), 
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universal quantifier as infimum, and existential quantifier as supremum is 
considered. For example, 

( yR.C) z (d ) = inf{max{l — R 1 (d, e) , C 1 (d, e)} \ e £ A 1 }, 

since \/R.C corresponds to the formula Vx.(-i R(x,y) V C(y)). 

Tresp and Molitor [1998] also propose an extension of the syntax by 
so-called manipulators, which are unary operators that can be applied to 
concepts. Examples of manipulators could be “mostly”, “more or less”, or 
“very” . For example, if Tall is a concept (standing for the fuzzy set of all tall 
persons), then VeryTall, which is obtained by applying the manipulator Very 
to the concept Tall, is a new concept (standing for the fuzzy set of all very 
tall persons). Intuitively, the manipulators modify the membership degree 
functions of the concepts they are applied to appropriately. In our exam- 
ple, the membership function for VeryTall should have its largest values at 
larger heights than the membership function for Tall. Formally, the seman- 
tics of a manipulators is defined by a function that maps membership degree 
functions to membership degree functions. The manipulators considered in 
[Tresp and Molitor, 1998] are, however, of a very restricted form. 

Lets us now consider what kind of inference problems are of interest in 
this context. Yen [1991] considers crisp subsumption of fuzzy concepts, i.e. , 
given two concepts C, D defined in the fuzzy Description Logic, he is inter- 
ested in the question whether C X {d) < D I (d) for all fuzzy interpretations I 
and d £ A 1 . Thus, the subsumption relationship itself is not fuzzified. He 
describes a structural subsumption algorithm for a rather small fuzzy De- 
scription Logic, which is almost identical to the subsumption algorithm for 
the corresponding classical Description Logic. In contrast, Tresp and Moli- 
tor [1998] are interested in determining fuzzy subsumption between fuzzy 
concepts, i.e., given concepts C, D, they want to know to what degree C 
is a subset of D. In [Straccia, 1998; 2001] and [Molitor and Tresp, 2000], 
also ABoxes are considered, where the ABox assertions are equipped with a 
degree. In this context one wants to find out to what degree other assertions 
follow from the ABox. 

Both [Straccia, 1998; 2001] and [Tresp and Molitor, 1998] contain com- 
plete algorithms for solving these inference problems in the respective fuzzy 
extension of ACC. Although both algorithms are extensions of the usual 
tableau-based algorithm for ACC, they differ considerably. For example, 
the algorithm in [Tresp and Molitor, 1998] introduces numerical variables 
for the degrees, and produces a linear optimization problem, which must be 
solved in place of the usual clash test. In contrast, Straccia deals with the 
membership degrees within his tableau-based algorithm. 
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6.2.6 Extensions by default rules 

In Description Logics, inclusion axioms of the form CCD are interpreted 
as universal statements, i.e., all instances of C also belong to D. The same 
is true for inferred subsumption relationships. In commonsense reasoning, 
however, one often wants to state and infer relationships that are only “nor- 
mally” true, but may have exceptions. The most prominent example from 
the non-monotonic reasoning community is the statement that all birds fly; 
but of course penguins and other non- flying birds are exceptions. Allowing 
such default statements has a strong impact both on the semantics and the 
reasoning capabilities of Description Logics. Instead of basing the semantics 
on classical first-order logic, one must employ a non-monotonic logic [Gins- 
berg, 1987]. In fact, conclusions drawn from a given knowledge base with 
defaults may ultimately turn out to be false when additional knowledge is 
added, and thus must be withdrawn. 

Since most of the classical Description Logics can be seen as fragments 
of first-order predicate logic, an obvious approach for extending Descrip- 
tion Logics by non-nronotonic reasoning capabilities is to take one of the 
well-known non-nronotonic logics, and restrict the first-order version of this 
logic to the Description Logic in question. This approach was employed in 
[Baader and Hollunder, 1995a], where Reiter’s default logic [Reiter, 1980] is 
integrated into Description Logics. In addition to terminological axioms in 
the TBox and assertions in the ABox, Baader and Hollunder allow termi- 
nological defaults of the form 


C(x) : D(x) 

E{x) ’ 

where C, D,E are concept descriptions (viewed as first-order formulae with 
one free variable x). Intuitively, such a default rule can be applied to an 
ABox individual a, i.e., E{a) is added to the current set of beliefs, if its 
prerequisite C(a) is already believed for this individual and its justification 
D{a) is consistent with the set of beliefs. Formally, the consequences of a 
terminological default theory (consisting of a TBox, an ABox, and a set 
of terminological defaults) are defined with reference to the notion of an 
extension , which is a set of deductively closed first-order formulae defined 
by a fixpoint construction (see [Reiter, 1980]). In general, a default theory 
may have more than one extension, or even no extension. Depending on 
whether one wants to employ skeptical or credulous reasoning, an assertion 
F(a) is a consequence of a default theory iff it is in all extensions or if it is 
in at least one extension of the theory. 
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It should be noted that in this setting the application of default rules 
is restricted to individuals explicitly present in the ABox." For example, 
assume that the ABox consists of the fact that Tom has a child that is a 
doctor, i.e. , A = {(3has-child.Doctor)(TOM)}, and that by default we assume 
that doctors are usually rich: 

Doctor(x) : Rich(x) 

Rich(x) 

Intuitively, one might expect that (3has-child.Rich)(TOM) is a default conse- 
quence of this terminological default theory. However, since the ABox does 
not contain a name for Tom’s child, the default cannot be applied to this 
“implicit” individual, and thus one cannot conclude that Tom has a rich 
child by default. Baader and Hollunder [1995a] give two reasons that jus- 
tify restricting the application of defaults to explicit individuals. From a 
semantic point of view, adapting Reiter’s treatment of implicit individuals 
via skolenrization is quite unsatisfactory, since semantically equivalent (but 
syntactically different) ABoxes may lead to different default consequences. 
From the algorithmic point of view, the application of defaults to implicit 
individuals is problematic since it may lead to an undecidable default conse- 
quence relation, even though the Description Logic employed is decidable. In 
contrast, the restriction of default application to explicit individuals ensures 
that reasoning in terminological default theories stays decidable whenever 
reasoning in the underlying Description Logic is decidable. 

A major drawback, which terminological default logic inherits from gen- 
eral default logic, is that it does not take precedence of more specific defaults 
over more general ones into account. For example, assume that we have a de- 
fault that says that doctors are usually rich, and another one that says that 
general practitioners are usually not rich, and that classification shows that 
general practitioners are doctors. Intuitively, for any general practitioner 
the more specific second default should be preferred, which means that 
there should be only one default extension, in which the general practitioner 
is not rich. However, in default logic the second default has no priority over 
the first one, which means that one also gets a second extension where the 
general practitioner is rich. This behavior has already been criticized in the 
general context of default logic, but it is all the more problematic in the 
terminological case where the emphasis lies on the hierarchical organization 
of concepts. To overcome this problem, Baader and Hollunder [1995b] 
first define a prioritized version of Reiter’s default logic, where priorities 

7 This agrees with the semantics given to (monotonic) rules in Description Logics (see Subsec- 
tion 6.2.3 and Chapter 2, Subsection 2.2.5). 
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are given by an arbitrary partial order on defaults. In the terminological 
case, the priority is induced by the subsumption relationship between 
prerequisites of defaults. A similar approach is proposed in [Straccia, 1993], 
with the main difference that in that paper the defaults also influence the 
priority order. In addition, Straccia also allows defaults of the form 

A(x) A r(x, y) : C(y ) 

C{y) 

where A is an atomic concept, r a role name, and C a concept description. 
Such a default can, for example, be used to say that usually a child of a 
doctor is again a doctor. 

A quite different proposal for how to treat defaults in Description Log- 
ics can be found in [Quantz and Royer, 1992]. There, preference semantics 
[Shoharn, 1987] is employed to define the semantics of default assertions 
C ^ D, which are intuitively interpreted as saying: “whenever an object is 
an instance of C. it is also an instance of D, unless this is in conflict with 
other knowledge” . Though on this intuitive level the meaning of the default 
C ^ D coincides with that of the terminological default C(x) : D(x)/D(x), 
the formal semantics (and thus also the default consequences) differ signifi- 
cantly. The semantics proposed by Quantz and Royer is based on a prefer- 
ence relation on models, which tries to minimize the exceptions to defaults 
while maximizing the number of defaults that have been fired. In contrast to 
the work mentioned above, Quantz and Royer do not restrict reasoning with 
defaults to the derivation of concept assertions of the form C(a). They also 
consider default subsumption between concepts. However, default subsump- 
tion is reduced to reasoning about individuals. The subsumption relation- 
ship C D follows by default from the knowledge base iff the knowledge 
base extended by C(a) implies D(a) by default, where a is a new individ- 
ual name. Designing reasoning methods for such a model-based approach 
to non-monotonic reasoning is rather hard. Quantz and Royer only provide 
some ideas for how to obtain a sound but incomplete procedure. 

Default subsumption is also considered in [Padgham and Zhang, 1993], 
where non-monotonic inheritance networks [Horty et al, 1987] are extended 
in the direction of Description Logics, though the Description Logic em- 
ployed is of a very limited expressive power. 


6.3 Non-standard inference problems 

All DL systems provide their users with standard inference services like 
computing the subsumption hierarchy and testing ABox consistency. In some 
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applications it has turned out, however, that these services are not quite 
sufficient for providing an optimal support when building and maintaining 
large DL knowledge bases. For this reason, some DL systems (e.g., Classic) 
provide their users with additional system services, which can formally be 
reconstructed as new types of inference problems. 

First, the standard inferences can be applied after a new concept has been 
defined to find out whether the concept is non-contradictory or whether its 
place in the taxonomy coincides with the intuition of the knowledge engi- 
neer; however, these inferences do not directly support the process of actu- 
ally defining the new concept. To overcome this problem, the non-standard 
inference services of computing the least common subsumer and the most 
specific concept have been proposed. 

Second, if a knowledge base is maintained by different knowledge 
engineers, one needs support for detecting multiple definitions of the same 
intuitive concept. Since different knowledge engineers might use different 
names for the “same” primitive concept, the standard equivalence test may 
not be adequate to check whether different descriptions refer to the same no- 
tion. The non-standard inference service unification of concept descriptions 
tackles this problem by allowing concept names to be replaced by appropri- 
ate concept descriptions before testing for equivalence. Matching is a special 
case of unification, which has, for example, been used for pruning irrelevant 
parts of large concept descriptions before displaying them to the user. 

Third, and very abstractly speaking, rewriting of concept descriptions 
allows one to transform a given concept description C into a “better” de- 
scription D, which satisfies certain optimality criteria (e.g., small size) and is 
in a certain relationship (e.g., equivalence or subsumption) with the original 
description C. 

Before describing the different non-standard inferences in more detail, we 
start with some general remarks on how these new problems have until now 
been tackled in the literature. An overview of the state of the art in this 
field and detailed proofs of several of the results mentioned below can be 
found in [Kiisters, 2001]. 


6.3.1 Techniques for solving non-standard inferences - a 

general remark 

Approaches for solving the new inference problems are usually based on an 
appropriate characterization of subsumption, which can be used to obtain a 
structural subsumption algorithm. First, the concept descriptions are turned 
into a certain normal form, in which implicit facts have been made explicit. 
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Second, the structure of the normal forms is compared appropriately. This 
is one of the reasons why most of the results on non-standard inferences 
are restricted to languages that can be treated by structural subsumption 
algorithms. 

One can distinguish two kinds of normal forms proposed in the literature. 
In one approach, called language-based approach in the sequel, the normal 
form of a concept description is given in terms of certain finite or regular sets 
of words over the alphabet of all role names. Then, subsumption can be char- 
acterized via the inclusion of these sets (see Chapter 2, Subsection 2. 3. 3. 2). 
The second approach, called graph-based in the following, turns concept 
descriptions into so-called description graphs. Here, subsumption of concept 
descriptions is characterized via the existence of certain homomorphisms 
between the corresponding description graphs. The structural subsumption 
algorithm introduced in Chapter 2, Subsection 2.3.1, can be represented in 
this way (although this was not explicitly done in Chapter 2). 

For the sublanguage ACM of Classic, the graph-based approach can be 
seen as a special implementation of the language-based approach [Baader 
et ai, 1998a]. In general, however, either the language-based or the graph- 
based approach may turn out to be more appropriate, depending on the De- 
scription Logic under consideration. On the one hand, the language-based 
approach is particularly useful for characterizing subsumption between cyclic 
concept descriptions, i.e., descriptions defined by means of cyclic terminolo- 
gies in F'Cq and ACM [Baader, 1996b; Kiisters, 1998]. On the other hand, the 
graph-based approach can be employed to handle full Classic [Borgida and 
Patel-Schneider, 1994] as well as ACS [Baader et ai, 1999b], which extends 
J-Cq by primitive negation and existential restrictions. Although Borgida 
and Patel-Schneider did not explicitly characterize subsumption in terms of 
homomorphisms between description graphs, their subsumption algorithm 
does in fact check for the existence of an appropriate homomorphism. 

The known approaches for solving non-standard inference problems are 
usually based on one of the two approaches for characterizing subsumption, 
depending on the Description Logic of choice. In the sequel, we will give 
an idea of how to solve the inference problems by mainly looking at the 
language-based approach for the Description Logic !FCq. We will also briefly 
comment on how to treat extensions of J-Cq. 


6.3.2 Least common subsumer and most specific concept 

Intuitively, the least common subsumer of a given collection of concept de- 
scriptions is a description that represents the properties that all the elements 
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of the collection have in common. More formally, it is the most specific con- 
cept description that subsumes the given descriptions: 

Definition 6.13 Let £ be a description language. A concept description 
E of £ is the least common subsumer (lcs) of the concept descriptions 
Ci , . . . , C n in £ (7cs(Ci, . . . , C n ) for short) iff it satisfies: 

1. Ci C E for all i = 1, . . . , n, and 

2. E is the least £-concept description satisfying (i), i.e., if E' is an £-concept 
description satisfying Ci C E' for all i = 1, . . . , n, then E C E' . 

As an easy consequence of this definition, the lcs is unique up to equiva- 
lence. In fact, if E\ and are both least common subsumers of the same 
collection of concepts, then E\ C E 2 (since E 2 satisfies (i) and E \ is the least 
concept description satisfying (i)). The subsumption relationship E 2 E E\ 
can be derived analogously. It should be noted, however, that the lcs need 
not always exist. This can have two different reasons: (a) there may be sev- 
eral subsumption incomparable minimal concept descriptions satisfying (i) 
of the definition; (b) there may be an infinite chain of more and more spe- 
cific descriptions satisfying (i). It is easy to see, however, that for Description 
Logics allowing conjunction of descriptions (a) cannot occur. 

The lcs was first introduced by Cohen et al. [1992] as a new inference task 
that is useful for a number of different reasons. First, finding the most spe- 
cific concept that generalizes a set of examples is a common operation in in- 
ductive learning, called learning from examples. Cohen and Hirsh [1994a] as 
well as Frazier and Pitt [1994] investigate the learnability of sublanguages of 
Classic with regard to the PAC learning model proposed by Valiant [1984]. 
The lcs-computation is used as a subprocedure in their learning algorithms. 
Experimental results concerning the learnability of concepts based on com- 
puting the lcs can be found in [Cohen and Hirsh, 1994b]. 

Another motivation for considering the lcs is to use it as an alternative to 
disjunction. The idea is to replace disjunctions like C\ U • • • U C n by the lcs 
of Ci, . . . , C n . In [Cohen et al., 1992; Borgida and Etherington, 1989], this 
operation is called knowledge base vinification. Although, in general, the lcs is 
not equivalent to the corresponding disjunction, it is the best approximation 
of the disjunctive concept within the available Description Logic. The use of 
such an approximation is motivated by the fact that, in many cases, adding 
disjunction would increase the complexity of reasoning. Observe that, if 
the Description Logic already allows disjunction, we have Ics(Ci, . . . , C n ) = 
Ci U • • • U C n . In particular, this means that, for such Description Logics, 
the lcs is not really of interest. 
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Finally, as proposed in [Baader and Kiisters, 1998; Baader et al, 1999b], 
the lcs operation can be used to support the “bottom-up” construction of 
DL knowledge bases. In contrast to the usual “top-down” approach, where 
the knowledge engineer first defines the terminology of the application do- 
main in the TBox and then uses this terminology when describing indi- 
viduals in the ABox, the “bottom- up” approach proceeds as follows. The 
knowledge engineer first specifies some “typical” examples of a concept to 
be defined using individuals in the ABox. Then, in a second step, these 
individuals are generalized to their most specific concept, i.e. , a concept 
description that (i) has all the individuals as instances, and (ii) is the 
most specific description satisfying property (i). Finally, the knowledge en- 
gineer inspects and possibly modifies the concept description obtained this 
way. 

Let us now define the most specific concept of an ABox individual in more 
detail. 

Definition 6.14 A concept description E in some description language C 
is the most specific concept (msc) of the individuals a\, . . . ,a n defined in an 
ABox A (msc(a i, . . . , a n ) for short) iff: 

1 . A \= E(af) for all* = 1, ... , n, and 

2. E is the least concept satisfying (i), i.e., if E' is an ^-concept description 
satisfying A |= Efiaf) for all i = 1, . . . , n, then E C E' . 

The task of computing the msc can be split into two subtasks: computing 
the most specific concept of a single individual, and computing the least 
common subsumer of a given finite number of concepts. In fact, it is easy to 
see that msc(a\, . . . , a n ) = lcs(msc(ai), . . . , msc(a n )). 

6.3.2. 1 Computing the lcs and the msc 

We will now give an intuition on how to compute the lcs for the Description 
Logic T C,q and an extension, and briefly comment on the problems that 
arise when considering the msc. As mentioned above, the first step towards 
an algorithm for computing the lcs is to characterize subsumption of con- 
cept descriptions. For the Description Logic J-jCq, we will present such a 
characterization using the language-based approach. 

The normal form of .T-Xo-concept descriptions employed in the language- 
based approach is the so-called concept- centered normal form (CCNF), 
which has already been introduced in Chapter 2, Subsection 2. 3. 3. 2. For 
example, using the equivalence \/R.(C fl D) = MR.C n MR.D and commuta- 
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tivity of concept conjunction, the .^.Co-concept description C = VR.(\/S.A n 
MR.B ) fl VS. VS. A can be transformed into CCNF as follows: 


C = VR.VS.A n VS.VS'.A n VR.VR.B 
= V{RS, SS}. A n V{RR}.B. 


Recall that \/{RS, SS}.A has been introduced in Chapter 2, Subsec- 
tion 2. 3. 3. 2 as an abbreviation for VR.VS.A n VS.VS.A. Similarly, \/{RR}.B 
abbreviates VR.VR.B. 

In general, if Nq is a finite set of atomic concepts and Nr is a finite set 
of role names, then the CCNF of a concept C built using only these names 
is of the form 


C = 


n 

A&N C 


VU a .A , 


where Ua is a finite set of words over the alphabet of role names, i.e. , Ua Q 
Nft. Note that V0.A represents the universal concept T, and Vjej.A for the 
empty word e is equivalent to A. 

If the CCNF of D is I \agn c VVa-A then subsumption of C by D can be 
characterized as follows: 


Proposition 6.15 C C D iff Va C Ua for all A £ Nq- 

As an easy consequence, we obtain 
Corollary 6.16 lcs(C,D ) = I \agn c V(17a Fl Va)-A. 

By Proposition 6.15, this concept description obviously subsumes C and D. 
Moreover, Ua H Va is the largest set contained in both Ua and Va, and thus 
i UeiVc V(£/a FI Va)-A is in fact the least concept subsuming both C and D. 

As an example consider the concept C specified above and D = 
V{RS, RR}.A n V{RR, SR}.B. Then, lcs(C, D) = V{RS}.A n V{RR}.B. 

For Description Logics extending RLq by constructs that can express 
unsatisfiable concepts, like _L, the language-based approach can still be 
applied. However, in order to characterize subsumption, we need to con- 
sider certain infinite regular languages instead of finite ones. The rea- 
son is that _L is subsumed by an infinite number of concept descrip- 
tions. For example, although \/{R, RSR}.A- C V{RR}.1, we do not have 
V± = {RR} C {R,RSR} =: U±. However, we know that V{R}.T is sub- 
sumed by \/{Rw}.A- for any word w of the alphabet Nr. Consequently, we 
must use U^-N^ = {vw \ v £ U± and w £ A^} in place of U± in the inclu- 
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sion test. For this reason, the lcs must also be described in terms of possibly 
infinite regular languages. As a simple example, consider the concept de- 
scriptions C = V{R, SR }.- L and D = V{RS, 5}._L. Then, 

lcs{C, D) = V({i? , SR}-N^ n {RS, S}-N* R ).± 

= \/({RS,SR}-N* r ).±_ 

= V{RS, SR}.±. 

A detailed description of how to compute the lcs in ACJV , which ex- 
tends T Cq by _L, atomic complement, and number restrictions, is given 
in [Baader and Kiisters, 1998]. Moreover, Baader and Kiisters investigate 
cyclic AAAAconcept descriptions, which are defined in terms of cyclic ter- 
minologies with greatest fixpoint semantics. In this context, the languages 
U a introduced above can be arbitrary regular languages (see also Chapter 2, 
Subsection 2. 3. 3. 2). 

Cyclic descriptions become necessary if one wants to guarantee the ex- 
istence of the msc. Consider, for example, the ABox consisting only of the 
assertion R(a,a). Then, we know that msc(a) C VR. ■ ■ ■ VR.(^ 1 R) for ar- 
bitrarily deep nesting of value restrictions. Baader and Kiisters show that 
there does not exist an acyclic ACAAconcept description presenting the msc 
of a. However, the msc of individuals described in AAAAABoxes can always 
be represented by a cyclic AAAAconcept description. In our example, msc(a) 
can be represented by the concept A defined by A = (= 1 R) n Vi?. A, if this 
definition is interpreted with greatest fixpoint semantics. 

Using the graph-based approach, the lcs can be computed for the 
Description Logic that extends FCq by the same-as construct [Cohen and 
Hirsh, 1994a; Frazier and Pitt, 1994; Kiisters and Borgida, 2001], for the 
language ACC, which extends CCq by full existential quantification and 
primitive negation [Baader et ai, 1999b], and for the language ACSM, 
which extends ACS by number restrictions [Kiisters and Molitor, 2001b]. 
On the one hand, it is not clear how to handle these languages with the 
language-based approach. On the other hand, up to now the graph-based 
approach cannot deal with cyclic concept descriptions, which are needed 
for computing the msc. Consequently, for the extensions of TC$ treated 
with the help of the graph-based approach, the msc can currently only be 
approximated [Cohen and Hirsh, 1994b; Kiisters and Molitor, 2001a]. 


6.3.3 Unification and matching 

Unification and matching are non-standard inferences that allow us to 
replace certain concept names by concept descriptions before testing for 
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equivalence or subsumption. This capability turns out to be useful when 
maintaining (large) knowledge bases. In this subsection, we will first intro- 
duce unification and matching and mention the main motivations for con- 
sidering these new inference tasks. We will then review the results available 
in the literature, and give an intuition on how unification problems in the 
small language TCq can be solved. 

6.3.3. 1 Unification 

Unification of concepts was first introduced by Baader and Narendran [1998] , 
motivated by the following application problem. If several knowledge en- 
gineers are involved in defining new concepts, and if this knowledge ac- 
quisition process takes rather long (several years), it happens that the 
same (intuitive) concept is introduced several times, often with slightly 
differing descriptions. Testing for equivalence of concepts is not always 
sufficient to find out whether, for a given concept description, there 
already exists another concept description in the knowledge base de- 
scribing the same notion. As an example, let us ask whether the fol- 
lowing two .TTCo-concept descriptions might denote the same (intuitive) 
concept. 

Vhas-child.Vhas-child.Rich n Vhas-child.Rmr, 

Acr n Vhas-child.Acr n Vhas-child.Vhas-spouse.Rich. 

The answer is yes, since replacing the concept name Rmr by the description 
Rich n Vhas-spouse.Rich and Acr by Vhas-child.Rich yields the descriptions 

Vhas-child.Vhas-child.Rich n Vhas-child.(Rich n Vhas-spouse.Rich), 

Vhas-child.Rich n Vhas-child.Vhas-child.Rich n Vhas-child.Vhas-spouse.Rich, 

which are obviously equivalent. Thus, under the assumption that Rmr stands 
for “Rich and married rich” and Acr for “All children are rich” , we can con- 
clude that both descriptions are meant to express the concept “All grand- 
children are rich and all children are rich and married rich” . 

A substitution of concept descriptions for concept names that makes two 
concept descriptions C, D equivalent is called a unifier of C and D. Of course, 
before testing for unifiability, one must decide which of the concept names 
the unifier is allowed to replace. These names are then called concept vari- 
ables to distinguish them from the usual concept names, which cannot be 
replaced. In the above example, the strange acronyms Acr and Rmr were 
considered to be variables, whereas Rich was treated as a (non-replaceable) 
concept name. Concept descriptions containing variables are called concept 
patterns. More precisely, T 'Co-concept patterns are defined by means of the 
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following syntax rules: 

C,D — ► X | A | VR.C \CnD 

where X stands for concept variables. 

Now, a substitution in EC-o is a mapping from the concept variables into 
the set of .TTCo-concept descriptions. An example is the substitution {Rmr i— > 
Rich n Vhas-spouse. Rich, Acr i— > Vhas-chi Id .Rich} used in our example. The 
application of a substitution can be extended from variables to .TTCo-concept 
patterns in the usual way (as exemplified above). 

Definition 6.17 Let C,D be T'To-concept patterns. Then, a substitution 
cr is a unifier of the unification problem C = ? D iff a(C) = <r(D). 

Of course, it is not necessarily the case that concept descriptions that are 
unifiable in this way are really meant to represent the same notion. A unifi- 
ability test can, however, suggest to the knowledge engineer possible candi- 
date descriptions. 

6. 3. 3. 2 Matching 

Matching can be seen as a special case of unification, where one of the two 
expressions to be unified does not contain variables [Baader and Narendran, 
1998; 2001], Thus, a matching problem is of the form C = ? D where C is a 
concept description and D a concept pattern. A substitution a is a matcher 
of this problem iff C = cr(D). 

Borgida and McGuinness [1996] have introduced a different notion of 
matching, which we call matching modulo subsumption to distinguish it 
from matching modulo equivalence , as introduced above. A matching prob- 
lem modulo subsumption is of the form C C ? D, where C is a concept de- 
scription and D is a concept pattern. Such a problem asks for a substitution 
a such that C C cr(D). 

Since a is a solution of C C ? D iff a solves C C n D, matching modulo 
subsumption can be reduced to matching modulo equivalence, and thus to 
unification. However, in the context of matching modulo subsumption, one 
is interested in finding “minimal” solutions of C C ? D, i.e., a should satisfy 
the property that there does not exist another substitution <5 such that C C 
5(D) c &(D). In addition, Baader et al. [1999a] introduce side conditions of 
the form ICB and X n E, with X a variable and E a concept pattern, 
to further restrict possible substitutions for the variables occurring in the 
matching problem. 

The original reason for introducing matching modulo equivalence was 
(i) to help filter out unimportant aspects of complicated concepts appearing 
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in large knowledge bases, and (ii) to specify patterns for explaining proofs 
carried out by Description Logic systems [McGuinness and Borgida, 1995]. 
For example, matching the concept pattern 

D = Vresearch-interests.A 


against the description 

C = Vpets.Cat n Vresearch-interests.AI n Vhobbies. Gardening 

yields the minimal matcher a = {X i— »• Al } , and thus finds the scientific in- 
terest described in the concept, filtering out the other aspects described by 
C. 

Another motivation for matching as well as unification can be found in 
the area of integrating data or knowledge base schemas represented in some 
Description Logic. An integrated schema can be viewed as the union of 
the local schemas along with some interschema assertions satisfying certain 
conditions. Finding such interschema assertions can be supported by solving 
matching or unification problems. Borgida and Kiisters [2000] propose a 
formal framework for schema integration, and provide initial theoretical as 
well as experimental results concerning this application of unification and 
matching. 


6. 3. 3. 3 Results on matching and unification 
As with computing the lcs, the algorithms for matching that can be found in 
the literature follow either the language-based or the graph-based approach. 
Matching modulo subsumption for a description language containing most 
of the constructs available in Classic has been considered in [Borgida and 
McGuinness, 1996]. Borgida and McGuinness describe a polynomial-time 
matching algorithm, which follows the graph-based approach. However, this 
algorithm cannot be applied to arbitrary patterns, and it is not complete. Us- 
ing the language-based approach, complete and polynomial-time algorithms 
for matching modulo equivalence and matching modulo subsumption in J-Cq 
were presented in [Baader and Narendran, 1998; 2001]. This result was ex- 
tended to the language ACM by Baader et al. [1999a] and its extension 
ALMxeg by the role constructors union, composition, and transitive closure 
by Kiisters [2001]. Baader et al. [2001] consider matching under side condi- 
tions in more detail. Basically, subsumption conditions of the form ICB 
leave the complexity of matching in ACM polynomial, whereas strict sub- 
sumption conditions X \Z E cause NP-hardness. Matching in ACS based on 
the characterization of subsumption by homomorphism between graphs has 
been investigated in [Baader and Kiisters, 2000]. It is shown that matching 
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modulo equivalence is NP-complete, and that appropriate matchers can be 
computed in exponential time. Finally, complete algorithms for matching in 
Classic are provided by Kiisters [2001]. 

For unification, the only results available to date are for the small De- 
scription Logic TLq and its extension XXreg by the role constructors 
union, composition, and transitive closure. In [Baader and Narendran, 1998; 
2001] it is shown that deciding unifiability of XXo-patterns is an ExpTime- 
complete problem, and in [Baader and Kiisters, 2001] this result is extended 
to X£reg- In the remainder of this subsection, we will try to give a flavor of 
how to solve unification problems in FCq. 

As an immediate consequence of Proposition 6.15, equivalence of FCq- 
concept descriptions C = r~l agn c VC/a-A and D = I \agn c VVa-A in CCNF 
can be characterized as follows: 

C = D iff U A = V A for all A € N c . (6.3) 

This fact can be used to turn XXo- un ification problems into certain formal 
language equations, which then can be solved using tree automata. 

Let us illustrate this by the example from Subsection 6.3.3. 1. There, we 
considered the unification problem 8 

\/{cc}.R n V{c}.A V{e, c}.Y n \/{cs}.R. 

As an easy consequence of (6.3), a substitution a of the form 

{X^\/U x -R, Y^VUy.R}, 

where Ux, Uy are sets of words over the alphabet {c, s}, is a unifier of this 
problem iff the assignment X = Ux and Y = Uy solves the formal language 
equation 

{cc} U {c}-X = {cs} U {e, c}-Y. 

For example, the unifier {X e- > R n Ws.R, Y e- > \/c.R} corresponds to the 
solution X = {e, s}, Y = {c} of the above formal language equation. In gen- 
eral, unification problems correspond to systems of formal language equa- 
tions of the form 

S 0 U Si-Ai U ■ ■ • U S n -X n = T 0 U TyXi U • • • U T n -X n , 

where the Si , T are given finite sets of words and the A, are vari- 
ables ranging over finite sets of words. In [Baader and Narendran, 1998; 


8 To increase readability, has-spouse is replaced by s, has-child by c, Rich by R, and Rmr. Acr by 
the variables X, Y. In addition, we have already transformed the patterns into their CCNF. 
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2001] it is shown that solvability of such a system of equations can be re- 
duced (in exponential time) to the emptiness problem for automata on finite 
trees. This yields an ExpTiME-decision procedure for unification in J-jCq- 
For unification in .££reg, the S t , Ti are regular languages, and to test the 
equation for solvability one must employ automata working on infinite trees. 


6.3.4 Concept rewriting 

A general framework for rewriting concepts using terminologies has been 
proposed in Baader et al. [2000]. Assume that C\,C-2, and £3 are three 
description languages, and let C be an £i-concept description and T an 
£2-TBox. We are interested in rewriting (i.e., transforming) C into an /23- 
concept description D such that C and D are in a certain relationship (e.g., 
equivalence, subsumption w.r.t. T) and such that D satisfies certain opti- 
mality criteria (e.g., being of minimal size). 

This very general framework has several interesting instances. In the fol- 
lowing, we will discuss the three most promising ones. 

The first instance is the translation of concept descriptions from one De- 
scription Logic into another. Here, we assume that C\ and £3 are different 
description languages, and that the TBox T is empty. By trying to rewrite 
an £i-concept C into an equivalent £3-concept D, one can find out whether 
C is expressible in £3. In many cases, such an exact rewriting may not exist. 
In this case, one can try to approximate C by an £3-concept from above (be- 
low), i.e., find a minimal (maximal) concept description D in £3 such CCD 
(D C C). An inference service that can compute such rewritings could, for 
example, support the transfer of knowledge bases between different systems. 
First results in this direction for the case where £1 is ACC and £3 is ACS 
can be found in [Brandt et al., 2001]. 

The second instance comes from the database area, where the problem 
of rewriting queries using views is a well-known research topic [Beeri et al., 
1997]. The aim is to optimize the runtime of queries by using cached views, 
which allows one to minimize the (more expensive) access to source relations. 
In the context of the above framework, views can be regarded as TBox 
definitions and queries as concept descriptions. Beeri et al. [1997] investigate 
the instance where £1 = £2 = ACCN 1 Z and £3 = {n,U}. More precisely, 
they are interested in maximally contained total rewritings, i.e., D should 
be subsumed by C, contain only concept names defined in the TBox, and be 
a maximal concept (w.r.t. subsumption) satisfying these properties. They 
show that such a rewriting is computable (whenever it exists). 
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The third instance of the general framework, which was first proposed 
in [Baader and Molitor, 1999], tries to increase the readability of large 
concept descriptions by using concepts defined in a TBox. The motiva- 
tion comes from the experiences made with non-standard inferences (like 
lcs, msc and matching) in applications. The concept descriptions produced 
by these services are usually unfolded (i.e., do not use defined names), 
and are thus often very large and hard to read and comprehend. There- 
fore, one is interested in automatically generating an equivalent concept 
description of minimal length that employs the concept names defined in 
the underlying terminology. Referring to the framework, one thus consid- 
ers the case where C, = C\ = £2 = £3 and the TBox is nonempty. For a 
given concept description C and a TBox T in C one is interested in an £- 
concept description D (containing concept names defined in T) such that 
C =q- D and the size of D is minimal. Rewriting in this sense has been 
investigated for the languages ALM and ACS [Baader and Molitor, 1999; 
Baader et al, 2000]. Rewritings can be computed by a nondeterministic 
polynomial algorithm that uses an oracle for deciding subsumption. The 
corresponding decision problem (i.e., the question whether there exists a 
rewriting of size < k for a given number k) is NP-hard for both languages. 
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Abstract 

A DL-based knowledge representation system is more than an inference en- 
gine for a particular Description Logic. A knowledge representation system 
must provide a number of services to human users, including presentation of 
the information stored in the system in a manner palatable to users and jus- 
tification of the inferences performed by the system. If human users cannot 
understand what the system is doing, then the development of knowledge 
bases is made much more difficult or even impossible. A knowledge rep- 
resentation system must also provide a number of services to application 
programs, including access to the basic information stored in the system 
but also including access to the machinations of the system. If programs 
cannot easily access and manipulate the information stored in the system, 
then the development of applications is made much more difficult or even 
impossible. 


7.1 Introduction 

A DL-based knowledge representation system does not live in a vacuum. 
It has to be prepared to interact with several sorts of other entities. One 
class of entities consists of human users who develop knowledge bases using 
the system. If the system cannot effectively interact with these users then 
it will be difficult to create knowledge bases in the system, and the system 
will not be used. Another class of entities consists of programs that use 
the services of the system to provide information to support applications. 
If the system cannot effectively interact with these programs then it will be 
difficult to create applications using the system, and the system will not be 
used. 
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However, before one can talk about effective interaction, there has to be 
basic interaction between the knowledge representation system and applica- 
tions or users. This basic interaction has to do with the mechanics of telling 
information to the system and retrieving information from it. At this level 
the system just maintains what it was told and responds to the queries by 
running an inference procedure for the logic it implements. 

The basic interface is not sufficient for effective access to the system. 
On the application side there is need for a treatment of exceptional con- 
ditions, wider interface to applications, remote interfaces, and concurrent 
access, among others. There is also need for responsive reaction by the 
system. On the human side there is need for better presentation of the 
results of queries, particularly the suppression of irrelevant detail; explana- 
tion of the inferences performed by the system; better support for the cre- 
ation of large DL knowledge bases, particularly by several people working in 
collaboration. 

Even if all the above are present in a system, it will still not be complete. 
There is also a need to have effective information about the system widely 
available. This information has to be in various forms, including the obvious 
user manuals, but also including interactive tutorials and demonstration 
systems. 

A system that does not include all of the above services is not a complete 
knowledge representation system. 

Our discussion of the services that need to be provided will mostly 
be described in terms of an arbitrary DL knowledge representa- 
tion system. However, some of our examples will be given in the 
context of the Classic family of knowledge representation systems 
developed at AT&T [Borgida et al. , 1989; Brachman et al . , 1991; 
Patel-Schneider et al . , 1991], as Classic has had the longest-lived and 
most extensive industrial application history of any DL knowledge rep- 
resentation system. The Classic application that we will refer to the 
most is the configuration of transmissions equipment - an application 
developed within AT&T [Wright et al, 1993; McGuinness et al, 1995; 
McGuinness and Wright, 1998b; McGuinness et al . , 1998]. 

In a typical configuration problem, a user is interested in entering a small 
number of constraints and obtaining a complete, correct, and consistent 
parts list. Given a configuration application’s domain knowledge and the 
base DL inference system, the application can determine if the user’s con- 
straints are consistent. It can then calculate the deductive closure of the 
user-stated knowledge and the background domain knowledge to generate 
a more complete description of the final parts list. For example, in a home 
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theater demonstration configuration system [McGuinness et al . , 1995], user 
input is solicited on the quality a user is willing to pay for and the typical use 
(audio only, home theater only, or combination), and then the application 
deduces all applicable consequences. This typically generates descriptions for 
6-20 subcomponents which restrict properties such as price range, television 
diagonal, power rating, etc. A user might then inspect any of the individual 
components possibly adding further requirements to it which may, in turn, 
cause further constraints to appear on other components of the system. Also, 
a user may ask the system to “complete” the configuration task, completely 
specifying each component so that a parts list is generated and an order 
may be completed. 

This home theater configurator example is fairly simple but it is mo- 
tivated by real-world application uses in configuring very large pieces of 
transmission equipment where objects may have thousands of parts and 
subparts and one decision can easily have hundreds of ramifications. It was 
complicated applications such as these that drove our work on access to 
information. More information can be found on Description Logics for con- 
figuration in this book in Chapter 12. Another example application that 
drove our work on information access and presentation needs was a sim- 
ple DL backend system supporting knowledge-enhanced search for the web 
called FindUR [McGuinness, 1998; McGuinness et al . , 1997] which is also 
described in Chapter 14. 


7.2 Basic access 

Basic access to a DL knowledge base consists of simple mechanisms to cre- 
ate DL knowledge bases and to query them. The foundational aspects of 
this basic interaction have been well-studied. For example, Levesque [1984] 
proposed that the basic interface to any knowledge representation system 
consist of two kinds of interactions - one to tell information to the system 
and one to ask whether information follows from what was previously told 
to the system. 

Many frame-oriented knowledge representation systems embody such dis- 
tinctions, such as the Generic Frame Protocol [Chaudhri et al, 1997], and 
OKBC (Open Knowledge Base Connectivity) [Chaudhri et al., 1998a]. In 
the DL community, this basic interaction was standardized into an in- 
terface specification that defined a number of Tell&Ask operations that 
a DL knowledge representation system should implement [Patel-Schneider 
and Swartout, 1993]. This specification is commonly known as the Krss 
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Table 7.1. Syntax and semantics of making definitions. 


Program Syntax 

Abstract Syntax 

Semantics 

(def ine-concept CN C) 

CN = C 

CN 1 = C 1 

(def ine-primitive-concept CN C) 

CN QC 

CN 1 C C 1 

(define-role RN R) 

RN = R 

RN 1 = R 1 

(def ine-primitive-role RN R) 

RN C R 

RN 1 C R 1 

(def ine-attribute AN A) 

AN = A 

AN 1 = A 1 

(def ine-primitive-attribute AN R) 

AN C R 

AN 1 C R 1 


specification. 1 The description of a minimal DL knowledge representation 
system interface given here will generally follow this Krss specification. The 
Krss specification incorporates the DFKI standardized syntax and seman- 
tics [Baader et al., 1991]. Examples given here follow the syntax of Chapter 2, 
for the abstract syntax, and the syntax of Krss for a LiSP-like syntax that 
can actually be used from within a computer. 

One problem with defining a Tell&Ask interface for a DL knowledge repre- 
sentation system is that even a minimal interface depends on the expressive 
power of the logic. As an example, if the Description Logic implemented 
by the system does not include individuals then of course there is no need 
to include any facilities for making statements about individuals. To over- 
come this difficulty this chapter will describe the interfaces required for a 
system that implements a typical Description Logic with both concepts and 
individuals. 

Such a system has to have a method for creating a terminology of concepts. 
A syntax for creating such a terminology, taken directly from the Krss 
specification, is given in Table 7.1. A terminological knowledge base, or 
TBox, is then a set of such definitions perhaps with the condition that every 
concept, role, and attribute name has at most one definition. There may 
also be the side condition that there are no recursive definitions. 

Some representation systems may have other definitions allowable or other 
restrictions. For example, some systems allow the definition of transitive 
roles, via a define-transitive-role definition. Other systems prohibit non- 
primitive roles. 

If the underlying Description Logic allows recursive definitions, then it 
may be easier to provide an even more basic interface to define concepts. 
Table 7.2 shows a minimal interface for a system that employs arbitrary 
concept inclusions as its means of defining concepts. 

1 The Krss specification also incorporates a number of operations that fall under the advanced 

interface that will be discussed later. 
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Table 7.2. Inclusion syntax and semantics. 


Program Syntax 

Abstract Syntax 

Semantics 

(included C D) 

CQD 

C x C D 1 

Table 7.3. Assertion syntax and semantics. 

Program Syntax 

Abstract Syntax 

Semantics 

(instance IN C) 

IN eC 

IN 1 G C x 

(related IN I R) 

(IN, I) G R 

(IN 1 , I x ) G R x 


Table 7.4. Query syntax and semantics. 


Query 

Meaning 

(concept-subsumes? Cl C2) 

c x cc x 

(role-subsumes? R1 R2) 

ftp c R x 

(individual-instance? IN C) 

IN 1 G C x 

(individual-related? IN I R) 

(IN 1 , I x ) G R x 


If the system incorporates individual reasoning, then it has to have a 
mechanism for adding information about these individuals. One such method 
is via the assertions in Table 7.3. An assertional knowledge base, or ABox, 
is then a set of such assertions. 

Once information has been told to the system, there has to be a mechanism 
for determining what follows from this information. A minimal mechanism 
for this is via a set of queries, such as those given in Table 7.4. The system 
answers a quary by determining if the meaning of the query is implied by 
the information that has been told to the system. 

The interface described above is sufficient for determining the contents 
of a knowledge base but only in the theoretical sense. For reasonable ac- 
cess to the information in a knowledge base a richer interface is required. 
One part of this richer access even really belongs in the basic interface, 
namely retrievals of taxonomy information. The interface in Table 7.5 pro- 
vides a simple interface to the taxonomy information implicit in a DL knowl- 
edge base. The meaning of the calls should be obvious from their descrip- 
tion, except perhaps the “-direct-” versions, which return the concepts, 
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Table 7.5. Taxonomy retrieval 
syntax. 


(concept-descendants C) 
(concept-children C) 
(concept-ancestors C) 
(concept -parents C) 
(concept-instances C) 
(concept-direct-instances C) 
(role-descendants R) 
(role-children R) 
(role-ancestors R) 
(role-parents R) 
(individual-types IN) 
(individual-direct-types IN) 
(individual-fillers IN R) 


individuals, or roles that are directly related to the query, i.e., that have no 
intervening concept or role. 

Another basic service that is missing from above interface is the ability 
to remove information from the knowledge base. This is not the ability to 
perform arbitrary changes to the implicit information represented by the 
knowledge base. Instead it is just the ability to “un-tell” information that 
had been previously told to the system. A basic interface for this purpose is 
given in Table 7.6. There may be restrictions on what can be un-told, such 
as requiring that concepts that are currently mentioned in the definition of 
other concepts cannot be removed from the knowledge base. 


7.3 Advanced application access 

The basic interface described above provides only minimal access to a DL 
knowledge base. Effective access requires a number of augmentations to the 
basic interface. 

One of the most important augmentations has to do with defining a com- 
plete application programming interface (API). The basic interface assumes 
that the system is implemented in a language like Lisp, where there is a 
simple way of creating descriptions and other values for the various opera- 
tions and there is a mechanism for returning values of any type. This was 
acceptable when systems and applications were all implemented in Lisp, but 
this is no longer the case. 

A complete API must then provide a syntax for creating all the types 
of values that need to be passed to the representation system. Further, it 
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Table 7.6. UnTell syntax. 


(undef ine-concept CN) 
(undef ine-role RN) 

(undef ine-attribute AN) 
(un-tell-instance IN C) 
(un-tell-related IN I R) 


needs to provide or define mechanisms for returning values, particularly 
compound values such as the sets of concepts that are returned by the 
taxonomic retrieval operations. 


7.3.1 Efficiency 

Because the operations of the representation system may represent the 
largest resource consumption of an application, it is often necessary to know 
how expensive various operations of the system may be. For example, it is 
often necessary to know the usual resource consumption of the most fre- 
quently called operations of the knowledge representation system or those 
operations that are called at critical time in the operation of the whole 
system. 

The Classic family has been particularly aggressive in ensuring that 
queries to the system are fast, working under the assumption that the most 
common operations are queries. Most queries in Classic are simply re- 
trievals of data stored by the system, as Classic responds to the addition 
of knowledge by computing most of its consequences. Further, the perfor- 
mance of the addition of knowledge to the system is optimized over the 
retraction or change of knowledge. 

Classic achieves these characteristics of fastest queries, fast additions, 
and slower retractions and changes by retaining data structures that record 
the current set of consequences and also record, on a fairly granular level, 
which knowledge affects other knowledge. This is not full truth-maintenance 
data, which would be prohibitively expensive to compute (and store) , but is 
just enough to make additions cheap. It also serves to make retractions and 
changes somewhat cheaper than they otherwise would be, but this effect is 
much less than the gain in speed of adding knowledge. 
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7.3.2 Wide application programming interface 

In the vast majority of applications, the knowledge representation system 
has to serve as a tightly integrated component of a much larger overall 
system. For this to be workable, the knowledge representation system must 
provide a full-featured interface for the use of the rest of the system. 

The NeoClassic system, which is programmed in C++, and is designed to 
be part of a larger C++ program, provides a very wide application program- 
ming interface. In addition to the above interface, there is a large interface 
that lets the rest of the system receive and process the actual data struc- 
tures used inside NeoClassic to represent knowledge, but without allowing 
these structures to be modified outside of NeoClassic . 2 This interface al- 
lows much faster access to the knowledge stored by NeoClassic, as many 
accesses just retrieve fields from a data structure. Further, direct access to 
data structures allows the rest of the system to keep track of knowledge from 
NeoClassic without having to keep track of a “name” for the knowledge 
querying using this name. (In fact, it is in this way possible to dispense with 
any notion of querying by name.) 

There are also ways to obtain the data structures that are used by Neo- 
Classic for other purposes, including explanation. We have used this fa- 
cility to write graphical user interfaces to present explanations and other 
information. 

An additional interface that is provided by both Lisp Classic and Neo- 
Classic is a notification mechanism, or hooks. This mechanism allows pro- 
grammers to write functions that are called when particular changes are 
made in the knowledge stored in the system or when the system infers new 
knowledge from other knowledge. Hooks for the retraction of knowledge from 
the system are also provided. These hooks allow, among other things, the 
creation of a graphical user interface that mirrors (some portion or view of) 
the knowledge stored in the representation system. 

Others in the knowledge representation community have recognized the 
need for common APIs, (e.g., the Generic Frame Protocol [Chaudhri et ai, 
1997] and the Open Knowledge Base Connectivity [Chaudhri et al . , 1998a]). 
Some systems embrace the notion of loading many different forms of knowl- 
edge bases and accept wrapper specifications for other source formats and 
APIs. For example, Ontolingua has implemented capability for load- 
ing a number of formats including Classic, OKBC, ANSI KIF, KIF 3.0, 
CML, CLIPS, Ontolingua, Protege, Snark, and DAML+OIL. It also 
provides the ability to dump frames in multiple formats such as OKBC, 

2 Of course, as C++ does not have an inviolable type system, there are mechanisms to modify 

these structures. It is just that any well- typed access cannot. 
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Classic, CLOS, CML, Ontolingua, and DAML+OIL and it has also 
been made interoperable with at least two reasoners including one in Lisp 
and one in Java. 


7.3.3 Remote and concurrent access 

The standard computing environment is becoming more and more dis- 
tributed. If a DL knowledge representation system is to be part of this 
environment it must allow effective remote access. There are several mech- 
anisms for allowing remote access, including applications that run on the 
same machine as the DL knowledge representation system but themselves 
provide a remote access mechanism. Examples of such applications are the 
wines [Brachman et al, 1991] and stereo configuration demonstration sys- 
tems [McGuinness et al., 1995] mentioned later in this chapter. 

The DL knowledge representation system itself can also directly provide 
a remote access mechanism. This can be as simple as providing the system 
with a pipe-like interface where clients can send a sequence of commands to 
the system from remote machines, and receive responses via the same pipe. 
NeoClassic provides this sort of simple remote access mechanism. 

A more complicated remote access mechanism would be to provide a 
CORBA interface to the system. This kind of access was proposed by 
Bechhofer et al. [1999]. Their interface gives a CORBA layering around 
a Tell&Ask interface. Providing a wider CORBA access to DL knowledge 
representation systems, such as providing CORBA access to the actual data 
structures of the system, is more difficult, as the CORBA mechanism for 
dealing with recursive objects is annoying. Nevertheless, an effective remote 
access mechanism should provide the same functionality as is desired for 
local access. 

If remote access to a DL knowledge representation system is provided, 
then the issue of concurrent access becomes vital. (This is not to say that 
concurrent access is not of interest if the system does not allow remote ac- 
cess.) The interesting issues with respect to concurrent access involve simul- 
taneous access to the same repository of knowledge. Most of the issues with 
respect to concurrent access are the same as concurrent access to databases, 
including locking and providing transactions. In fact, there have been infor- 
mal proposals to use a database system to store the information in a DL 
knowledge representation system like Classic just so as to piggyback on 
the facilities for concurrent access provided by the database system. 

The remote interface proposal mentioned above provides a limited form of 
transactions, basically allowing clients to batch up a collection of updates to 
a knowledge base and apply them all at once as an atomic transaction. This 
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interface, however, does not provide any mechanism to abort transactions 
or to provide a local view of the knowledge base during the execution of a 
transaction. 

At least one other knowledge representation system has dealt with the no- 
tion of concurrent access by leveraging the notion of sessions. Ontolingua 
allows users to log into a particular session that may already be opened by 
a previous user. All users logged into the same session see the same version 
of the knowledge base. A more sophisticated approach to concurrent access 
and knowledge base editing is embodied in OntoBuilder [Das et al, 2001]. 
In this system, users can not only do something similar to sharing a session, 
but the implementation also facilitates collaboration through dialog with 
other users currently signed on to the same ontology and allows locking of 
concepts for updates. 


7.3.4 Platforms 

Another important access aspect concerns the platforms on which the knowl- 
edge representation system runs. This encompasses not only the machines 
and operating systems, but also the language in which the system is written 
(if it is visible), the version of the libraries that the system uses, and the 
mechanism for linking to the system. Many applications have needs for a 
particular operating system or language, and cannot utilize tools not avail- 
able in this context. 

Some Description Logics like Classic have been made available on a rea- 
sonable number of platforms. The underlying language of a member of the 
Classic family is visible, not just because of the API which is, of neces- 
sity, language-specific, but also because programmers can write functions to 
extend the expressive power of the system, and these functions have to be 
written in the underlying language of the system. 

Classic is currently available in two different languages: Lisp and C++. 
The C++ member is the more recent, and the reimplementation used C++ 
precisely to make Classic available for a larger number of applications. 
This was done even though C++ is not the ideal language in which to write 
a representation system. 

The members of the Classic family have also been written in a platform- 
independent manner. This has required not using some of the nicer capa- 
bilities of the underlying language or of particular operating systems. For 
example, NeoClassic does not use C++ exceptions, partly because few C++ 
compilers supported this extension to the language. Lisp-Classic runs on 
various Lisp implementations and on various operating systems, including 
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most versions of Unix, MacOS, and Windows. NeoClassic runs under four 
C++ compilers and on both Unix and Windows NT. 

With the influence of the web and more distributed development envi- 
ronments, it may be expected that more Description Logics may be made 
available on multiple platforms and may be integrated into more hybrid en- 
vironments. One example of another knowledge representation system that 
found a need to do this is the Chimaera Ontology Evolution Environment 
[McGuinness et al, 2000b]. This system has been connected to Ontolin- 
GUA for ontology editing and simple inference, a LiSP-based reasoner for 
some diagnostics, and a hybrid Java-based reasoning environment that sup- 
ports both first-order logic reasoning as well as special-purpose reasoning 
for the DAML+OIL Description Logic. 


7.4 Advanced human access 
7.4-1 Explanation 

Many research areas which focus on deductive systems (such as expert sys- 
tems and theorem proving) have determined that explanation modules are 
required for even simple deductive systems to be usable by people other 
than their designers. Description Logics have at least as great a need for 
explanation as other deductive systems since they typically provide simi- 
lar inferences to those found in other fields and also support added infer- 
ences particular to Description Logics. They provide a wide array of infer- 
ences [Borgida, 1992b] which can be strung together to provide complicated 
chains of inferences. Thus conclusions may be puzzling even to experts in 
Description Logics when application domains are unfamiliar or when chains 
of inference are long. Additionally, naive users may require explanations for 
deductions which may appear simple to knowledgeable users. Both sets of 
needs became evident in work on a family of configuration applications and 
necessitated an automatic explanation facility. 

The main inference in Description Logics is subsumption - determining 
when membership in one class necessitates membership in another class. For 
example, Person is subsumed by Mammal since anything that is a member of 
the class Person must be a member of the class Mammal. Almost every infer- 
ence in Description Logics can be rewritten using subsumption relationships 
and thus subsumption explanation forms the foundation of an explanation 
module [McGuinness and Borgida, 1995]. 

Although subsumption in most implemented Description Logics is cal- 
culated procedurally, it is preferable to provide a declarative presentation 
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of the deductions because a procedural trace typically is very long and is 
littered with details of the implementation. A declarative explanation mech- 
anism which relies on a proof-theoretic representation of deductions may be 
used as a framework. Such a mechanism has been specified [McGuinness, 
1996] and implemented for Classic and later specified for ALM [Baader 
et al, 1999a]. 

All the inferences in a DL system can be represented declaratively by proof 
rules which state some (optional) antecedent conditions and deduce some 
consequent relationship. The subsumption rules may be written so that they 
have a single subsumption relationship in the denominator. For example, if 
Person is subsumed by Mammal, then it follows that something that has 
all of its children restricted to be Persons must be subsumed by something 
that has all of its children restricted to be Mammals. This can be written 
more generally (with C representing Person, D representing Mammal, and 
R representing child) as the V restriction rule below: 


All restriction 


b CCD 
P VR.C E VR-D ' 


Using a set of proof rules that represent DL inferences, it is possible 
to give a declarative explanation of subsumption conclusions in terms of 
proof rule applications and appropriate antecedent conditions. This basic 
foundation can be applied to all of the inferences in Description Logics, 
including all of the inferences for handling constraint propagation and other 
individual inferences. There is a wealth of techniques that one can employ 
to make this basic approach more manageable and meaningful for users 
[McGuinness and Borgida, 1995; McGuinness, 1996]. 

Expressive DL-based systems may require a large number of proof rules. If 
one is interested in limiting both explanation implementation work and also 
the size of explanations, it is beneficial to prune the number of inferences 
to be explained. In one configuration family of applications [McGuinness 
and Wright, 1998b] the help desk logs were analyzed to determine the most 
frequent questions that related to explanation. These inferences included 
inheritance (if A is an instance of B and I? is a subclass of C. then A “in- 
herits” all the properties of C), propagation (if A fills a role R on B, and 
B is an instance of something which is known to restrict all of its fillers 
for the R role to be instances of D, then A is an instance of D), rule fir- 
ing (if a is an instance of E and E has a rule associated with it that says 
that anything that is an E must also be an F, then a is an instance of F), 
and contradiction detection (e.g., I cannot be an instance of something that 
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has at least 3 children and at most 2 children). In the initial development 
version, explanation was only provided for these inferences in an effort to 
minimize development costs, resulting in a quite useful explanation mecha- 
nism with much less effort than a full explanation system. (The two current 
implementations of explanation in Classic contain complete explanation.) 
One demonstration system [McGuinness et al, 1995] incorporates special 
handling for the most heavily used inferences providing natural language 
templates for presentations of explanations aimed at lay people. 


7.4-2 Error handling 

Since one common usage of deductive systems is for contradiction detection, 
handling error reporting and explanation is critical to usability. This usage 
is common in applications where object descriptions can easily become over- 
constrained. For example, in the home theater system application, one could 
generate a non-contradictory request for a high quality stereo system that 
costs under a certain amount. The description could later become inconsis- 
tent as more information is added. For example, a required high quality, ex- 
pensive speaker set could violate a low total price constraint. Understanding 
evolving contradictions such as this challenges many users and leads them 
to request special error explanation support. Informal studies with internal 
users and external academic users indicate that adequate error support is 
crucial to the usability of the system. 

Error handling could be viewed simply as a special case of inference where 
the conclusion is that some object is found to be described by a special con- 
cept typically called bottom or nothing. For example, a concept is incoherent 
if it has conflicting bounds on some role: 


Bounds Conflict 


hCC(^mr) hCC(^nr) n<m 


If an explanation system is already implemented to explain proof-theoretic 
inference rules, then explaining error conditions is almost a special case of 
explaining any inference. There are two issues that are worth noting, how- 
ever. The first is that information added to one object in the knowledge base 
may cause another object to become inconsistent. In fact, information about 
one object may impact another series of objects before a contradiction is dis- 
covered at some distant point along an inference chain. Typical DL systems 
require consistent knowledge bases; thus whenever they discover a contra- 
diction, they use some form of truth maintenance to revert to a consistent 
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state of knowledge, removing conclusions that depend on the information 
removed from the knowledge base. Thus, it is possible, if not typical, for an 
error condition to depend upon some conclusion that was later removed. A 
simple minded explanation based solely on information that is currently in 
the knowledge base would not be able to refer to these removed conclusions. 
Thus, any explanation system capable of explaining errors will need access 
to the inconsistent state of the knowledge base as well as to its current 
state. 

Because of the added complexity resulting from the distinction between 
the current (consistent) state and the inconsistent state of the knowledge 
base and because of the importance of error explanation, we believe sys- 
tem designers will want to support special handling of error conditions. For 
example, in a number of situations surveyed, users typically asked for ex- 
planations of a particular object property or relationships between objects. 
Under error conditions, users had more trouble identifying an appropriate 
query to ask. This suggests that special error support should be introduced. 
In Classic, for example, an automatic error explanation option is gener- 
ated upon contradiction detection. This way the user requires no knowl- 
edge (other than the error explanation command name) in order to ask 
for help. 

Another issue of importance to error handling is the completeness or in- 
completeness of the system. If a system is incomplete then it may miss 
deductions. Thus, it is possible that an object is inconsistent - if all of the 
logically implied deductions were to be made - but, because the system 
is incomplete, it misses some of these deductions and thus the object re- 
mains consistent in the knowledge base. In order for users to be able to 
use a system that is incomplete, they may need to be able to explain not 
only error deductions but deductions that were missed because of incom- 
plete reasoning. An approach that completes the reasoning with respect to 
a particular aspect of an object is described in [McGuinness, 1996, Chap- 
ter 5] . Given the completed information, the system can then explain missed 
deductions. 


7.4.3 Pruning 

If a knowledge representation system makes it easy to generate and reason 
with complicated objects, users may find naive object presentations to be 
much too complex to handle. In order to make a system more usable, there 
needs to be some way of limiting the amount of information presented about 
complicated objects. For example, in the stereo demonstration application, 
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a typical stereo system description may generate four pages of printout. The 
information contained in the description may be clearly meaningful infor- 
mation such as price ranges and model numbers for components but it may 
also contain descriptions of where the component might be displayed in the 
rack and which superconcepts are related to the object. In certain contexts 
it is desirable to print just model numbers and prices, and in other contexts 
it is desirable to print price ranges of components. We believe it is critical to 
provide support for encoding domain independent and domain dependent in- 
formation which can be used along with contextual information to determine 
what information to print or explain. As one example, we consider some of 
the knowledge bases written for the DARPA High Performance Knowledge 
Base project. This project includes a very general upper level ontology with 
many slots defined on many of the classes. Most objects in the system inherit 
a large number of slots from upper ontology classes and it is not uncommon 
for normalized objects to have hundreds of slots associated with them even 
though they only have a couple of properties defined on them in the local 
knowledge bases. 

Knowledge representation systems faced with information overload need 
to take some approach to filtering. One of the simplest approaches allows 
a specification on roles concerning whether they should be displayed on 
objects or not. This may work for homogeneous knowledge bases where role 
information is uniformly interesting or uninteresting. Our experience is, 
however, that context needs to be taken into account in more heterogeneous 
knowledge base applications. One example implementation that allows con- 
text and domain dependent information to be considered along with domain 
independent information is implemented in Classic. A meta-language is 
defined for describing what is interesting to either print or explain on a 
class by class basis. Any subclass or instance of the class will then inherit 
the meta-description and thus will inherit “interestingness” properties from 
its parent classes. The meta-language essentially captures the expressive 
power of the base Description Logic with some carefully chosen epistemic 
operators to allow contextual information (such as known fillers or closed 
roles) to impact decisions on what to print. 

The meta-language has been used to reduce object presentation and ex- 
planation by an order of magnitude in at least one application [McGuinness 
et ai, 1995]. This reduction was required for the application to be able 
to include object presentation. The algorithms of the basic approach are 
included in [McGuinness, 1996]; the theory of a generalized approach is pre- 
sented in [Borgida and McGuinness, 1996] and further analyzed in [Baader 
et al, 1999a]. 
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7-4-4 Knowledge acquisition 

If an application is expected to have a long life-cycle, then acquisition and 
maintenance of knowledge become major issues for usability. There are two 
kinds of knowledge acquisition which are worth considering: (i) acquisition of 
additional knowledge once a knowledge base is in place, and (ii) acquisition 
of original domain knowledge. A complete environment will address both 
concerns; however, the original acquisition of knowledge is a much more 
general and difficult problem and, conveniently enough, is not the activity 
that many users will find themselves doing repeatedly while maintaining a 
project. 

We observe that with knowledge of the domain and appropriate analysis 
of evolution, it is possible to build a knowledge evolution environment suit- 
able for non-experts to use for extending knowledge bases. One such project 
considered the evolution support environment for configurators. The spe- 
cific domain and usage patterns were analyzed, and it was found that only 
certain classes had new subclasses added to them as product knowledge 
evolved. It was also found that instances were typically populated in par- 
ticular patterns. A special purpose interface was developed for a family of 
configurators that exploited these findings and supported new configurator 
application development by non-experts [McGuinness and Wright, 1998b]. 
Also, in related work, Gil and Melz [1996] have analyzed planning-based 
uses of another DL-based system that systematically supports knowledge 
base evolution with respect to the known plan usage. 

A more general problem that does not rely on domain or reasoning knowl- 
edge has been addressed in the editor work [Paley et al . , 1997] for the gen- 
eral frame protocol and also in editor work for collaborative generation and 
maintenance of ontologies by non-experts in the Collaborative Topic Builder 
component of FindUR [McGuinness, 1998] and recently in Chimaera work 
[McGuinness et al, 2000b] for merging, analyzing, and maintaining ontolo- 
gies. The general work, of course, is broader yet shallower with respect to 
reasoning implications. In the FindUR collaborative topic builder environ- 
ment, simple hierarchies of node names (with role filler and value restriction 
information) are used to support query expansion to provide more intelli- 
gent web searching. In order to deploy this broadly, a web-based distributed 
ontology editor was required to allow non-experts to input, modify, and 
maintain background ontologies. The basic functionality for this interface 
follows the same requirements specified in Section 7.2 although this par- 
ticular implementation limited some of the interface specifications accord- 
ing to expected usage patterns. For example, in the medical deployments 
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[McGuinness, 1999] of FindUR, it was expected that all of the roles that 
were to be used had been defined and thus pulldown lists of these roles were 
hardcoded into the interface and new role specification was not one of the 
exposed functionalities in the GUI. FindUR also allows importing of seed 
ontologies and supports contradiction detection from ontology input. Chi- 
maera’s environment takes the analysis task to a much more detailed level 
and it provides a number of different ways of detecting not only explicit 
contradictions but also possible contradictions and possible term merges. 


7.5 Other technical concerns 

The computer science concerns that affect the suitability of a knowledge 
representation system have to do with the behavior of the system as a com- 
puter program or routine, ignoring its status as a representer of knowledge. 
The most-studied aspect of this collection of concerns has to do with the 
computational analysis of the basic algorithms embodied in the system, in 
particular their worst-case complexity. Because this worst-case complexity 
has been so well studied, we will not say anything about it further, except 
to state that it is important in determining the suitability of a knowledge 
representation system for particular task, notably tasks that need a perfor- 
mance guarantee. 


7.6 Public relations concerns 

Researchers sometimes underestimate the varied public relations aspects 
involved with making a system usable. Barriers to usability come in many 
forms: potential users who are unaware of a system’s existence will not use it; 
potential users who do not understand how a system can meet the their needs 
are unlikely to use it; potential users who do not have enough understanding 
to visualize an abstract solution to their problem using a new system are 
unlikely to depend on the new system over tools they understand and can 
predict; and finally potential users who have a limited set of approved tools 
which does not include the new system are unlikely go to the effort of getting 
the new system approved for their internal use. In order to address these 
issues, DL system designers need to devise ways to make their systems known 
to likely users, educate those users about the possible uses, provide support 
for teaching users how to use them for some standard and leverageable uses, 
and either obtain approval for their systems or provide ammunition for users 
to gain approval. 
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In experiences with Classic, the following tools have been employed to 
overcome the above stated barriers to usability. 

Beyond the standard research papers, users demand usage guidelines 
aimed at non-PhD researchers. A paper that provides a running (executable) 
example on how to use the system is most desirable: an example is [Brach- 
man et al., 1991]. That paper also tries to provide guidance on when a 
DL-based system might be useful, what its limitations are, and how one 
might go about using one in a simple application. That paper was used as 
the basis of a tutorial on building ontologies in other knowledge representa- 
tion systems including Protege and Ontolingua [Noy and McGuinness, 
2000 ]. 

A demonstration system is also of great utility as it helps users understand 
a simple reasoning paradigm and provides a prototyping domain for showing 
off novel functionality which exploits the strengths of the underlying system. 
In the Classic project a number of demonstration systems were developed, 
including a simple application that captures “typical” reasoning patterns 
in an accessible domain. This one system has been used in dozens of uni- 
versities as a pedagogical tool and test system. While this application was 
appropriate for many students, an application more closely resembling some 
actual applications was needed to (i) give more meaningful demonstrations 
internally and to (ii) provide concrete suggestions of new functionality that 
developers might consider using in their applications. This led to a more 
complex application with a fairly serious graphical interface [McGuinness 
et al . , 1995]. Both of these applications have been adapted for the web. 3 It 
was only when a demonstration system that was clearly isomorphic to the 
developer’s applications was available that there could be effective providing 
of clear descriptions and implemented examples of the functionality that we 
believed should be incorporated into development applications. 

Interactive courses are also of benefit in training potential users in how to 
use a DL-based knowledge representation system. Several courses [McGuin- 
ness et al, 1994; Abrahams et al., 1996] on how to use Classic have been 
developed, including one from a university for course use, which includes 
a set of five running assignments to help students gain experience using 
the system. Other general DL courses can be found on the DL web site at 
http : //www . dl . kr . org/. 

For a system to be used in the business community, it has to satisfy their 
demand for common standard implementation languages, reasonable sup- 
port, and standard platform toolkits. Some DL implementations, such as 

3 The web version of the wines demonstration system was provided by Chris Welty and is avail- 

able at http://untangle.cs.vassar.edu/wine-demo/index.htinl. 
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Classic, attempted to meet this need by providing an implementation in C 
while still maintaining the Lisp research version. This later proved problem- 
atic to maintain and the decision was made to provide an implementation 
in C++ that was to meet both developers’ and implementers’ needs. Inter- 
estingly enough, years later, it is the Lisp version that appears to be most 
heavily used. More details of the evolution of the usability of that system 
can be found in [Brachman et al., 1999]. 


7.7 Summary 

Although a knowledge representation system must have sufficient expressive 
power and appropriate computational complexity to be considered for use in 
applications, there are many other issues that also determine whether it will 
be used. These issues involve access to the knowledge stored in the system, 
such as explanation and presentation of the knowledge, other technical is- 
sues, such as efficiency and programming interfaces, and non-technical issues, 
such as publicity and demos. If these issues are not addressed appropriately, 
a knowledge representation system will not be used in real applications. 
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Abstract 

This chapter discusses implemented DL systems that have played or play 
an important role in the field. It first presents several earlier systems that, 
although not based on Description Logics, have provided important ideas. 
These systems include Kl-One , Krypton, Nikl, and Ivandor. Then, 
successor systems are described by classifying them along the characteris- 
tics discussed in the previous chapters, addressing the following systems: 
Classic (“almost” complete, fast); Back, Loom (expressive, incomplete); 
Kris, Crack (expressive, complete). Finally, a new optimized generation 
of very expressive but sound and complete DL systems is also introduced. 
In particular, we focus on the systems Dlp, FaCT, and Racer and explain 
what they can and cannot do. 


8.1 New light through old windows? 

In this chapter a description of the goals behind the development of differ- 
ent DL systems is given from a historical perspective. The description of 
DL systems allows important insights into the development of the knowl- 
edge representation research field as a whole. The design decisions behind 
the well-known systems which we discuss in this chapter not only reflect 
the trends in different knowledge representation research areas but also 
characterize the point of view on knowledge representation that different 
researchers advocate. The chapter discusses general capabilities of the sys- 
tems and gives an analysis of the main language features and design decisions 
behind system architectures. The analysis of current systems in the light of 
a historical perspective might lead to new ideas for the development of even 
more powerful DL systems in the future. References to previous descriptions 
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of DL systems (e.g., in [MacGregor, 1991a; Woods and Schmolze, 1992; 
Horrocks, 1997a]) or publications on DL theory that also contain dis- 
cussions about DL systems (e.g., [Patel-Schneider, 1987a; Nebel, 1990a; 
Schmidt, 1991]) are included where appropriate. For references to other sys- 
tems not mentioned here see also [Woods and Schmolze, 1992] and [Nebel, 
1990b, pp. 46f., pp. 63f.]. 

Basic concept and role constructors have already been introduced in Chap- 
ter 2 (see also the Appendix for a summary of syntax and semantics of DL 
constructors). However, before starting the discussion about DL systems it 
is appropriate to introduce some notation for language constructors in order 
to keep this chapter self-contained. It is assumed that the reader is famil- 
iar with the basic Description Logics AC and ACC. In a similar way as in 
Chapter 2, further language features are indicated by different letters. The 
letter J\f is used for simple number restrictions and the letter Q is used for 
qualified number restrictions. XL is used for role hierarchies with multiple 
parents whereas h is used for role hierarchies with single inheritance only. 
In some languages, no role hierarchies but role conjunctions are provided. 
Role conjunctions are indicated with the letter 1Z in the following. In ad- 
dition, the abbreviations T and / are used for features with and without 
equality for feature chains (i.e., agreements), respectively. The index R + is 
used to indicate support for transitive roles. Language constructors for an 
extensional specification of concepts using nominals (or individuals) are de- 
noted by the letters O and B (see Chapter 2 or the Appendix for details). 
If inverse roles are supported by a DL system, this is indicated either by a 
superscript or by the letter X. The latter variant is used in order to allow 
a convenient pronunciation of the DL language. 

8.2 The first generation 

Inspired by research on human cognitive behavior, proposals for knowl- 
edge representation languages were first discussed in the late 1960s. For 
example, [Quillian, 1967] is one of the first publications of the languages 
called “semantic networks” (see also [Quillian, 1968]). Originally, seman- 
tic network formalisms were seen as alternatives to first-order logic. In 
a similar spirit, [Minsky, 1981] introduced the initial notion of a frame 
system. The motivation of these representation formalisms was to mimic 
human reasoning in the sense of achieving “cognitive adequacy”. Thus, 
the idea was to support problem solving with appropriate representation 
structures that somehow “resemble” representation structures assumed in 
human information processing. The exploitation of inheritance was a 
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predominant idea in frame systems. The specification of knowledge bases 
should be simple and the use of the representation structures should 
be intuitive (“epistemological adequacy”). However, as pointed out by 
[Woods, 1975], it was not at all simple to specify what an inference sys- 
tem was supposed to actually compute. The late 1970s saw initial re- 
search on the relation of frame systems and first-order logic [Hayes, 1977; 
1979] which revealed that some aspects of frame-based systems can be con- 
sidered as special “instantiations” of first-order reasoning. Hayes argued that 
frame-based reasoning was not an entirely new way of knowledge representa- 
tion with particular advantages over first-order reasoning. Specific features 
of frame systems beyond first-order reasoning (e.g., defaults) were not very 
well understood at that time. The consequence of these publications was that 
many researchers no longer considered frame systems and semantic network 
systems as possible alternatives to logic-based approaches. 

The criticisms of early frame systems and semantic network formalisms 
stimulated research on the development of mathematical structures and 
techniques for defining the semantics of representational constructs sup- 
ported by different representation languages. For instance, in early frame 
systems there was no clear distinction between constructs for representing 
“generic” knowledge about sets of individuals and knowledge about “spe- 
cific” individuals. Furthermore, frames were often used as data structures 
in procedural programs. For these programs a formal specification of what 
they were expected to compute was rarely provided. Rather than interpret- 
ing frame structures as data structures, [Woods, 1975] suggested using a 
formal semantics to clearly specify what is to be computed by inference 
algorithms. 


Kl-One 

Inspired by critics such as [Woods, 1975], Brachman started to develop a new 
representation system (called Kl-One) that inherently included the notion 
of inferring implicit knowledge from given declarations [Brachman, 1977b; 
1979]. Although the initial approach was not logic-based, Kl-One started 
the era of logic-based representation systems which can be used to formalize 
application problems as inference problems over the constructs supported 
by the representation language. One of the prevailing inference patterns is 
centered around inheritance [Brachman, 1983]. The final report on the Kl- 
One language is published in [Brachman and Schnrolze, 1985]. 

One of the core ideas behind Kl-One as a representation language for 
the “epistemological level” resulted from problems with languages offering 
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built-in primitives for general representation purposes (e.g., CD theory 
[Schank, 1975]). Rather than providing general built-in primitives, in Kl- 
One, for a specific representation problem a set of adequate primitives was 
defined by the user. The primitives were denoted by so-called concept names. 
The next idea was to use concept-forming operators to build new concepts 
from basic concepts. These compound concepts were also referred to as “con- 
cepts”, “concept terms” or “concept descriptions”. Generic concepts were 
intended to denote classes of individuals and individual concepts were in- 
tended to denote individuals (see also [Nebel, 1990a, p. 42]). Individuals 
were related by so-called roles which, in turn, could be primitive roles (role 
names) or roles described with role constructors [Brachman and Schmolze, 
1985]. 

In Kl-One, concepts and roles are the building blocks for representa- 
tional purposes. The main idea behind concepts and concept constructors 
in Kl-One is that the meaning of a concept is derived only from the mean- 
ing of its superconcepts and other restrictions associated with a concept 
[Brachman and Schmolze, 1985]. A Kl-One generic concept consists of a 
set of superconcept names, a set of role descriptions, and a set of struc- 
tural descriptions [Patel-Schneider, 1987a, pp. 58f.]. 1 Roles can be viewed 
as potential relationships between an individual of a certain class and other 
individuals in the world [Nebel, 1990a, p. 42]. 

Role descriptions could be either restrictions or differentiations. The for- 
mer restricted the class of permitted fillers (value restrictions) or the number 
of fillers (number restrictions). Role differentiations were used to describe a 
subrole with possible value or number restrictions. So-called structural de- 
scriptions were used to state relationships between the fillers of roles (see 
also [Patel-Schneider, 1987a, pp. 58f.]). Descriptions for individual concepts 
consisted simply of a set of values for roles plus a set of generic concepts. 
Individual concepts were seen as instances of these generic concepts, i.e. , 
an individual concept had to satisfy all restrictions (and differentiations) 
inherited by the generic concepts. On the other hand, individual concepts 
were also subsumed by their generic concepts. However, the semantics of 
individuals was never completely worked out (see [Schmolze and Brachman, 
1982, pp. 23-31] cited after [Nebel, 1990a, p. 64]). 

The representation structures offered by Kl-One were similar to those 
offered by semantic networks or frames. Although, initially, the structures 
offered by Kl-One were called “structural inheritance networks” [Brach- 

1 Note that, in Kl-One -like languages, there are specific syntactic constructs for specifying 
superconcepts. These specific constructs are no longer present in logic-based concept languages 
of the 1990s. 
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man, 1977b; 1979], in [Brachman and Levesque, 1984] the authors talk of 
“frame structures”. 2 In accordance with [Nebel, 1990a, p. 45] we argue that 
in contrast to, e.g., CD theory [Schank, 1975], providing a (large) set of 
primitive representation structures (names) for all kinds of representation 
purposes was not the development goal of Kl-One . As Nebel points out 
[Nebel, 1990a, p. 45] , more important and unique to Kl-One is the core idea 
of proving ways to specify concept definitions , i.e. , the ability to let a knowl- 
edge engineer declare the relation of “high-level concepts” to “lower-level 
primitives” . 

A concept definition was an assignment of a (unique) name to a concept 
term. In Kl-One the well-known distinction between the two kinds of con- 
cept definitions, definitions with necessary and sufficient conditions and def- 
initions with only necessary conditions (so-called primitive definitions), was 
investigated for knowledge representation purposes for the first time. 3 In the 
original approach no cycles were allowed in the set of concept definitions. 4 
The most important consequence of the introduction of concept definitions 
with necessary and sufficient conditions was that reasoning about the rela- 
tionships between concepts became important. In Kl-One there is still the 
notion of a “told subsumer” syntactically being explicitly mentioned in a list 
of so-called superconcepts but, according to the semantics, there are also ad- 
ditional computed subsumers which are concept names (direct subsumers or 
direct superconcepts). Note that inferences in Kl-One were based on the 
open-world assumption. Hence, unlike frame systems, where the names as 
superconcepts are always given explicitly, Kl-One introduced the idea that 
the set of direct superconcepts (i.e., concept names) for a given concept must 
be inferred. 

Direct superconcept-subconcept relationships (also called parent-children 
relationships) are dependent on the concept terms used in the definitions of 
a TBox. In particular, the notion of defined concepts (with necessary and 
sufficient conditions) led to the idea of classifying a TBox. The idea was 

2 There are large differences between frame systems and DL systems: if for i the restriction \fR.C 
holds, and we set i into relation to j via the role R, then every KL-ONE-based system concludes 
that j is an instance of C. In standard frame-based systems, j can only be set into relation to 
i via R if it is already known that j is an instance of C. Otherwise, in frame systems at least 
a warning is issued or even an error is signaled. 

3 In the literature, some authors use the word “definition” as a synonym for concept terms 
themselves (e.g., [Schmidt, 1991], see also [Woods, 1991, p. 65]). In this case, “primitive” 
concepts with only necessary conditions were introduced with a specific marker to be used in 
concept terms. 

4 The semantics of cycles was analyzed in [Baader, 1990b; 1991; Nebel, 1990a; 1991]. The so- 
called descriptive semantics provided many advantages over so-called fixpoint semantics. For 
details see [Nebel, 1990a]. One of the first publications of an expressive Description Logic 
supporting cyclic axioms with a descriptive semantics and a sound and complete calculus is 
[Buchheit et a/., 1993a]. Cyclic axioms are usually not considered as concept definitions. 



Description Logic Systems 


309 


to compute the subsumption hierarchy (sometimes also called “inheritance 
hierarchy”) of parents and children for each concept name mentioned in 
a TBox during a so-called classification process. The intention was that a 
model for a specific application domain could be verified by a knowledge 
engineer based on the subsumption hierarchy. Considering the subsumption 
hierarchy, i.e. , the lattice of direct superconcepts, the idea was also that 
concept terms could be automatically “inserted” between named concepts 
in the hierarchy. Hence, concept terms could be set into relation to “pre- 
defined” concept names (and, indirectly, other concept terms). This feature 
has been used in many projects for implementing application functionality. 

The first development of an algorithm for computing the subsumption 
hierarchy of a TBox (the “classifier”) is described in [Schmolze and Lipkis, 
1983]. Another inference component called “realizer” computes for each indi- 
vidual mentioned in an ABox the most specific atomic concepts (or concept 
names) of which the individual is an instance. One of the first algorithms 
for computing the realization of an ABox is described in [Mark, 1982], Ini- 
tial Kl-One systems were implemented in Interlisp [Lipkis, 1982] and 
Smalltalk [Fikes, 1982]. The Consul project [Kaczmarek et al, 1986] was 
one of the first projects in which classifier and realizer inference services 
were exploited. 

First investigations about defaults and exceptions were published in 
[Brachnran, 1985]. Nowadays, the semantical theory of defaults in Descrip- 
tion Logics is much clearer, see [Baader and Hollunder, 1992; 1993; Baader 
and Schlechta, 1993; Padgham and Zhang, 1993; Padgham and Nebel, 1993; 
Baader and Hollunder, 1995a; 1995b; Donini et al, 1997b]. 

At the first Kl-One workshop [Schmolze and Brachnran, 1982] it became 
clear that the informal specification of the semantics of Kl-One concept and 
role constructors led to serious problems. The development of the classifier 
[Schmolze and Lipkis, 1983] was based on the intuitive meaning of the Kl- 
One formalism [Nebel, 1990a, p. 46]. Attempts to logically reconstruct the 
representation constructs, e.g., [Schmolze and Israel, 1983; Israel and Brach- 
man, 1984], resulted in a deeper understanding of the formalism. Given the 
formal semantics, implemented algorithms for classification and realization 
were shown to be incomplete. Later investigations revealed that Kl-One 
(with the formal semantics given in the logical reconstruction approaches) 
is undecidable (e.g., this holds for the combination of conjunction, value re- 
strictions and role- value-maps [Schmidt-Schaufi, 1989]). In [Brachnran and 
Levesque, 1984] the first thoughts about tractability of subsumption for sub- 
languages are discussed. Terminological reasoning with concept definitions 
even for sublanguages with low expressiveness were shown to be inherently 
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intractable in the worst case [Nebel, 1990b, p. 28, pp. 71f.]. Proposals for a 
semantics based on many- valued logics (e.g., [Patel-Schneider, 1986; 1987a; 
1987b; 1989a]) ensure tractable algorithms concerning concept consistency 
reasoning but also result in a weak expressiveness: many intuitive inferences 
are not sanctioned by this semantics (see also [Nebel, 1990a]). 

Another result of [Schmolze and Brachman, 1982] was that the semantics 
of individual concepts was not quite clear (e.g., concerning coreference and 
unique name assumption). Thus, at the first Kl-One workshop [Schmolze 
and Brachman, 1982], the notions of a hybrid reasoning system consisting 
of a TBox (a set of concept definitions) and an ABox (a set of assertions 
concerning individuals) were made more precise. The change of the view on 
Kl-One spelled out in [Schmolze and Brachman, 1982, pp. 8-17] (see also 
[Nebel, 1990a, p. 46]) can be summarized as follows: It is not the names of 
representation structures that are important but the functionality, i.e. , the 
declaration and inference services which the system provides. It was first 
pointed out that inferences have to be formally defined based on the seman- 
tics of the representation formalism. This view led to the development of 
the functional view of knowledge representation as pursued with the devel- 
opment of the system Krypton. 

Krypton 

The knowledge representation system Krypton [Brachman et al., 1983a; 
1983b; 1985] can be seen as the first attempt at defining a new language of 
the Kl-One family with a formal, Tarskian semantics. Furthermore, the goal 
was to overcome the problems with individual concepts in Kl-One [Nebel, 
1990a, p. 63]. The hybrid representation approach with a TBox and an 
ABox was first implemented in the Krypton system (see also [MacGregor, 
1991a, p. 39l]). As in Kl-One, the distinction between primitive and defined 
concepts and the computation of the most specific atomic concepts which 
instantiate individuals is one of the core ideas of Krypton. 

Krypton offered a concept language with low expressiveness. While the 
initial approach [Brachman et al, 1983b] was too expressive to be tractable 
(see also [MacGregor, 1991a, p. 390]), in a revised version [Brachman et al, 
1985] the concept constructors of Krypton were defined as conjunction, 
value restrictions and role chains. Thus, subsumption checking was polyno- 
mial [Patel-Schneider, 1987a, p. 75]. For the ABox a full-fledged resolution- 
based FOPL theorem prover [Stickel, 1982] was proposed, i.e., the ABox 
reasoner of Krypton was incomplete. Another perspective is that Kryp- 
ton started with a first-order logic theorem prover and augmented it with 
a special-purpose inference system for terminological reasoning to cut out 
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some of the combinatorial search [Vilain, 1985]. Krypton can be regarded as 
one of the first efforts in combining knowledge representation and theorem- 
proving techniques but was not used for industrial applications [Nebel, 
1990a, pp. 63f.]. 

Rather than dealing with specific representation structures and operations 
on them. Krypton offers a so-called “functional approach” . Using the inter- 
face functions “tell” and “ask” , a knowledge base can be defined and queries 
can be answered about it. In this sense, a “functional approach” means that 
a formal representation system does not necessarily have to maintain, for 
instance, frame structures, the subsumption hierarchy, or even an ABox as a 
graph structure. If, for internal implementation purposes, graph structures 
are indeed used, they are nevertheless hidden from the user in order to avoid 
“procedural” operations being carried out with internal record structures. 
Arbitrary procedural operations are usually not related to the semantics of 
the representation formalism, so that, in this case, it is hard to characterize 
what is actually represented and what is computed as solutions to inference 
problems. Thus, the focus of Krypton was not on the structures to be 
maintained by the system but was centered around the question about what 
the system should do for the user, i.e. , what services should be made avail- 
able. In other publications this idea was described as the “knowledge level” 
[Newell, 1982]. In Krypton, inference services for concept terms are checks 
for concept consistency, disjointness, and subsumption. For a TBox, the most 
specific subsumers (parent-children relation) can be computed, whereas for 
an ABox, consistency, instance checking, realization (direct types) and in- 
stance retrieval are offered as inference services. Krypton pioneered the 
idea that the user should only know, at some level not dependent on imple- 
mentation details, what questions the system is capable of answering and 
what operations are permitted that allow new information to be provided 
to it. For instance, it is not important how the association between an in- 
dividual and a certain role filler is actually represented in terms of memory 
arrangements (called the symbol level). What counted for the underlying 
implementation was what operations must be supported in order to answer 
queries at the semantical level. This view about KL-ONE-based representa- 
tion systems was one of the major achievements of the Krypton project. 

Nikl, Penni, Kl-Two 

At the same time as Krypton, the knowledge representation system 
Nikl was developed as a successor of Kl-One. Nikl was a New Im- 
plementation of Kl-One [Schmolze and Israel, 1983; Schmolze, 1985; 
Schmolze and Mark, 1991]. As discussed in [Kaczmarek et al., 1986], in 
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Nikl, roles are also ordered with respect to subsumption (see also [Schmidt, 
1991, p. 13]). 

The assertional components of Kl-One were initially discarded in the 
Nikl system (see the Nikl user guide [Robins, 1986]). Compared to the 
initial Kl-One implementation, the algorithms in the Nikl classifier were 
faster in the average case because “obvious” information was exploited to a 
larger degree (see [MacGregor, 1988, p. 405] or [MacGregor, 1991a, p. 392]). 
However, the subsumption algorithm of Nikl was incomplete and it was 
hard to characterize which inferences were omitted [Schmolze and Israel, 
1983] (see also [Patel-Schneider, 1987a, p. 74]). 

Later, an assertional reasoning component was added with the system 
Penni which is based on RUP [McAllester, 1982]. The resulting system was 
called Kl-Two [Vilain, 1985] (see also [Schmidt, 1991, p. 15]). In Kl-Two a 
propositional reasoner with equality (the Penni subsystem) was augmented 
with a so-called quantificational reasoning component (the Nikl subsystem). 
For the propositional part in the Penni component, incremental additions 
and retractions were supported due to the facilities provided by RUP. 
However, as shown in [Patel-Schneider, 1989b] the concept language of Nikl 
contained concept and role constructs that rendered the satisfiability prob- 
lem for Nikl concept terms undecidable (see also [Schmidt-Schaufi, 1989]). 

Concerning hybrid reasoning, i.e., the systematic integration of TBox 
and ABox reasoning, there were shortcomings as well. Because in RUP 
different constants do not necessarily denote different objects, the unique 
name assumption was not built into the assertional component Penni. 
Thus, number restrictions imposed by Nikl concepts often did not have the 
intended effects concerning hybrid reasoning. Other sources of incomplete- 
ness were pointed out (see also the analysis of “inferential gaps” in [Nebel, 
1990a, pp. 63f.]). The research on the Kl-Two system demonstrated that 
hybrid reasoning is not just a matter of integrating reasoning subsystems 
at the software level. Hybrid reasoning requires a dedicated architecture 
implementing a sound and complete calculus which, in turn, can be 
developed only after a deep analysis of the semantics of the representation 
constructs. Nevertheless, the principal idea of exploiting subsumption 
information for resolution-based first-order reasoning has been integrated 
into many theorem-proving systems. 


Kandor 

Research on Kandor [Patel-Schneider, 1984] was influenced by the Kryp- 
ton architecture and the performance problems of the Nikl approach. The 
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goal of Kandor was to increase the expressive power of the terminological 
representation component in such a way that an efficient subsumption algo- 
rithm could be developed. Basically, Kandor supported conjunction, value 
restriction and number restrictions as concept- forming operators. In mini- 
mum number restrictions, range-restricted roles could be used (hence, qual- 
ified minimum number restrictions were allowed, see also [Patel-Schneider, 
1987a, p. 76]). In order to provide effective inference algorithms (e.g., for 
information retrieval scenarios) in the Kandor approach the expressive- 
ness of the assertional component was cut down to a representation system 
comparable to a database (without revision mechanisms). Subsumption in 
Kandor was shown to be coNP-complete (see [Nebel, 1988] and [Nebel, 
1990a, p. 90] for details). The initially proposed subsumption algorithm 
with polynomial runtime must have been incomplete. 

Kandor was called a frame-based system (which might be reasonable 
because of the expressiveness offered by the ABox language). A frame in 
Kandor was essentially a specification of conditions for describing how an 
individual can be an instance of it (in terms of superframes and restrictions). 
Kandor supported defined frames and primitive frames in the spirit of Kl- 
One. The system adopted the “small interfaces” approach of Krypton, 
i.e. , models were built using the declaration interface (tell interface), and 
application services were realized with the query interface (ask interface). 
Although called a frame system, frames were not treated as record structures 
to be manipulated by procedural programs. The authors of Kandor argued 
for a small knowledge representation system that could be used as part 
of larger systems with different subcomponents. The main achievement of 
Kandor was the introduction of a small-can-be-beautiful approach which, 
finally, led to the design of the system Classic which will be discussed in 
detail in the next section. 


8.3 Second generation Description Logic systems 

Whereas the prototypical implementations of first generation systems were 
used to study knowledge representation problems, second generation DL 
systems have been more extensively used in serious applications. The im- 
plementations discussed in this section are not only not prototypes but are 
much more stable. In addition, since the beginning of the 1990s, the systems 
have been called DL systems. We first discuss systems for (almost) tractable 
languages based on (almost) complete algorithms and investigate systems 
for expressive Description Logics afterwards. 
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Classic 

The basic Classic system supported the logic ACM 3Fh~ l with TBoxes and 
ABoxes plus facilities for dealing with numbers [Borgida et al, 1989]. We use 
the lowercase letter h to indicate that Classic supports role inclusion but 
not role conjunction, i.e., Classic supports “single-inheritance” role hierar- 
chies. Classic is available for research purposes. Implementation languages 
for Classic are CommonLisp [Steele, 1990] and C. The interfaces are de- 
scribed in [Resnick et al . , 1995]. Full Classic also contained the concept 
constructors O and B for referring to individuals in concept terms. 

Subsumption in full Classic was initially assumed to be polynomial 
[Borgida et al., 1989]. Problems with individuals in full Classic were rec- 
ognized in [Patel-Schneider et al., 1991]. At the same time, subsumption in 
Classic was shown to be conp complete [Lenzerini and Schaerf, 1991]. In 
the modified semantics for the concept constructors O and B (see [Borgida 
and Patel-Schneider, 1994]) the interpretation function maps individuals in 
concept terms to disjoint sets of domain objects. With this semantics con- 
cerning individuals the inference algorithms of the Classic system could be 
shown to be complete [Borgida and Patel-Schneider, 1994]. However, given 
the non-standard semantics for the concept constructors O and B, the same 
effect can be achieved with existential quantifications and disjunctions w.r.t. 
atomic concepts: 5 For each individual I a new atomic concept Aj can be in- 
troduced. Note that atomic concepts are also mapped to sets of individuals. 
Additionally, since Classic imposes the unique name assumption, a set of 
axioms ensures that the new atomic concepts are disjoint. Now every term 
of the form 3R.I can be replaced by 3R.Aj. Terms of the form {/i, . . . , /„} 
can be replaced by Aj 1 LI • • • U Aj n . In an ABox, for each individual / a con- 
cept assertion is added to ensure that the individual is an instance of the 
associated atomic concept Aj. Thus, only in an ABox, can a real coreference 
between roles be enforced. On the one hand, we can call the Classic system 
“almost” complete. “Almost” refers to non-standard semantics w.r.t. indi- 
viduals being supported by current system implementations. On the other 
hand, the transformation makes clear that in Classic nevertheless a limited 
kind of disjunction (with concept names for which no definitions exist) can 
be expressed while retaining polynomial inference algorithms. 

The recommended techniques for knowledge-based system development 
with Classic are outlined in [Brachman et al., 1991]. As Brachman [Brach- 
man, 1992, p. 256] points out, a tractable Description Logic does not guar- 
antee that a system is useful in practice. Therefore, the Classic system 

5 Note that these concept constructors are not directly provided by CLASSIC. 
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was also carefully designed to meet practical requirements and to guarantee 
predictable system behavior. The context in which the system was expected 
to be used required that many queries were given to knowledge bases which 
rarely changed. The architectural design of Classic supports a precompu- 
tation of index structures such that queries can be answered quickly (mostly 
by simple storage retrieval). The architecture is made possible by a careful 
selection of the concept and role constructors for the DL language. Inference 
services for the Description Logic supported by Classic can be implemented 
by transforming concept expressions into a normal form (“structural sub- 
sumption”). Once the normal form is computed, queries can be answered 
by inspecting the data structures used to encode the normal form. It should 
be noted that, in Classic, retraction of told information is possible but not 
optimized. 

Another facility offered by Classic is a rule system. Rules are applied 
to individuals explicitly named in the ABox. Furthermore, rules are applied 
in a forward-chaining way. Basically, a rule has a precondition (a concept) 
and a conclusion (also a concept). If it can be shown that an individual 
mentioned in the ABox is an instance of the precondition concept, a concept 
assertion for stating the membership of the individual in the conclusion 
concept is added to the ABox. In order to provide support for modeling, the 
rule base is statically checked for inconsistencies. For instance, if there are 
two rules whose preconditions subsume each other, the conclusions must not 
be disjoint. 

Furthermore, Classic provides simple support for closed-world reasoning 
([Resnick et al, 1995], see also [Weida, 1996]). Closing a role for an individ- 
ual means adding an appropriate maximum number restriction for the role. 
The maximum number of fillers is restricted to the largest integer such that 
the minimum number restriction with this integer (and the corresponding 
role) is entailed by the knowledge base. The problem with role closing is that 
in combination with rules, the exact sequence of several closing operations 
determines what actually holds in the resulting ABox. These and other prob- 
lems concerning different closing operations have to be considered with de- 
fault reasoning as the theoretical background [Baader and Hollunder, 1995a; 
1995b; Donini et al, 1997b; Rosati, 1998]. For a specific approach concerning 
the integration of defaults into the Classic system see also [Wahlof, 1996; 
Lambrix et al, 1998]. 

Classic is one of the first systems that provided support for incorpo- 
rating inferences over other domains. Consistency and subsumption check- 
ing for expressions of another domain (e.g., the reals) can be integrated 
into the Classic system via an extension interface [Borgida et al, 1996]. 
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Classic was one of the first DL systems designed with respect to users who 
are not experts in DL theory. An important lesson learned by the Classic 
approach and its applications was the importance of explanation and out- 
put pruning facilities [McGuinness and Borgida, 1995; McGuinness, 1996; 
Borgida and McGuinness, 1996]. Moreover, Classic was the first system 
capable of supporting some reasonable form of error reporting [Brachman, 
1992], However, in the current state of the art there is hardly an adequate 
measure for the quality of these indispensable services [Brachman, 1992, 
p. 253]. 

Although Classic was a very successful DL modeling environment, the 
low expressiveness of the Classic Description Logic made it hard to use 
the system in many kinds of applications. In many cases, users wanted more 
expressiveness [Patel-Schneider et al, 1990]. In the following sections we dis- 
cuss systems for (more) expressive Description Logics. As can be expected, 
increases in expressiveness came at a certain price. The predictability of the 
behavior of Classic in terms of performance could not be reached by sys- 
tems implementing complete algorithms for more expressive DLs. On the 
other hand, incomplete algorithms have the problem that results computed 
by a system cannot be trusted in general. Thus, the complete-incomplete 
debate for expressive DL systems started at the end of the 1980s and the 
beginning of the 1990s. First, we describe the systems Loom and Back, 
which are based on incomplete algorithms. Afterwards, initial research on 
DL systems based on complete algorithms is summarized with a discussion 
of the systems Kris and Crack. 


Loom 

The Loom architecture [MacGregor and Bates, 1987; MacGregor, 1991b] 
offers TBox and ABox reasoning facilities for a Description Logic that can 
be characterized by the name ACCQRJAFO plus additional constructs for 
dealing with real numbers (see also [Brill, 1994] or [Horrocks, 1997a, p. 43]). 
Loom is based on Kl-One, i.e. , concept definitions with necessary or with 
necessary and sufficient conditions play an important role in domain mod- 
eling with Loom. It should be emphasized that truth maintenance facilities 
for revision were built into the Loom architecture right from the begin- 
ning and have influenced the design of the whole system [MacGregor, 1988; 
MacGregor and Brill, 1992]. While the first Loom versions were based on 
Description Logics [MacGregor and Brill, 1992] , in later versions an attempt 
was made to develop a “description classifier for the Predicate Calculus” 
[MacGregor, 1994]. For instance, facilities for dealing with definitions for 
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relations were added. The current version of Loom is implemented in Com- 
monLisp and is available for research purposes. A new system (called Pow- 
erLoom) for CommonLisp as well as C- and Java-based platforms can be 
licensed as well. 

A distinguishing design goal of Loom was the incorporation of an expres- 
sive query language for retrieving ABox individuals. Another design goal of 
Loom was to support rule-based programming [Yen et al, 1991b; 1991a; 
MacGregor and Burstein, 1991]. Based on the rule system, it is possible to 
specify additional necessary conditions for individuals which (i) are explicitly 
mentioned in the ABox and (ii) are derived to be instances of a certain de- 
fined concept. The additional necessary conditions are called “implications” 
in Loom [MacGregor, 1988]. The additional necessary conditions specified 
by rules are not exploited, for instance, for TBox reasoning. Note that an 
“implication” A — ► B stated by a Loom rule does not mean that -> B — > -> A 
holds, i.e. , rule-based “implications” are not to be confused with true logi- 
cal implications as provided by generalized concept inclusions that are now 
standard in newer systems (see below). 

In order to meet the performance requirements of the applications for 
which Loom was developed (e.g., natural language and image interpreta- 
tion), incomplete algorithms for concept consistency and subsumption are 
implemented. Concerning ABox reasoning, Loom applications required spe- 
cific strategies to avoid the computation of unused results. Rather than 
employing the usual forward-chaining strategy of computing the most spe- 
cific atomic concepts of which the ABox individuals are instances, Loom 
uses a scheme that considers the queries being posed to the system. Thus, 
backward-chaining strategies for query answering are used in the imple- 
mentation [MacGregor and Brill, 1992]. However, for the rule system, it is 
important to detect whether an individual is an instance of a concept that is 
used as a precondition of a rule. In this case, forward-chaining techniques are 
exploited [MacGregor, 1991b; MacGregor and Brill, 1992]. The combination 
of forward-chaining and backward-chaining inferences can be specified for 
a certain application problem by “marking” concepts accordingly. The user 
can control the inference process by these means but is also responsible for 
estimating the effects of these declarations. 

The arguments for the Loom approach can be summarized as follows: 
The intractability of the representation language can hardly be avoided if 
the requirements of users are to be fulfilled. Therefore, the idea is to support 
the features in one system rather than as a set of application-specific ad hoc 
supplements (“Where resides the scruffiness?” [MacGregor, 1991a, p. 396]). 
Obviously, incompleteness is no problem as long as the answers of the 



318 


R. Moller and V. Haarslev 


inference system are interpreted in the right way (i.e., “no” answers should 
not be trusted) . Several researchers argued that there is always the inherent 
danger that non-expert users either do not know this or might not recog- 
nize this as a potential danger (cf. the work on complete systems [Baader 
and Hollunder, 1991a; 1991b] discussed below). However, if a combinatorial 
explosion occurs in a complete algorithm, in practice, no result is available 
at all. Concerning incomplete algorithms for decidable Description Logics, 
arguments similar to those for other modeling environments based on first- 
order logic can be mentioned: If, in a certain application, concept terms are 
checked for consistency and combinatorial explosions occur in complete al- 
gorithms, incomplete algorithms at least might provide some support, e.g., 
for building a TBox. Just signaling a timeout during the execution of a com- 
plete algorithm that runs into a combinatorial explosion might result in less 
information. In this case, an incomplete algorithm might succeed in finding 
at least some inconsistencies. Note however, that in modern inference system 
technologies supporting complete reasoning, incomplete reasoners are used 
as “preprocessors” in order to speed up inferences (see the next chapter). 

Loom supports different kinds of individuals (classified instances, light 
instances, CLOS instances). For different kinds of instances different levels 
of inference services are supported, e.g., for classified instances, the set of 
most specific atomic concepts of which the classified individual is an instance 
is computed once new assertions are specified. Thus, for classified instances, 
the rule-based forward-chaining engine is triggered, possibly adding new 
assertions to an ABox (for details see [MacGregor and Brill, 1992]). 

A problem with the Loom approach is that from a user perspective it is 
hard to characterize the source of the incompleteness of the Loom reason- 
ing algorithms (see the discussion in [Horrocks, 1997a, p. 42]). Although the 
inference techniques used in Loom are characterized in [MacGregor, 1991b, 
p. 90], once a system is incomplete there is no adequate measure for the 
“quality of service” in terms of an implementation-independent characteri- 
zation. For instance, in Classic the characterization of the incompleteness 
of the inference system concerning individual reasoning was given in terms 
of a weak semantics for the offered representation constructs (see above). It 
should be noted that specifying the incompleteness on the semantic level is 
by no means a trivial task. Not only incompleteness issues are important in 
this context. For instance, the theoretical background for giving a semantics 
for rule-based computations was only investigated recently [Donini et ai, 
1992b; 1994a; 1998a]. 

Incomplete reasoning facilities might lead to unexpected behavior. We 
demonstrate by an example that incomplete inference algorithms can have 
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effects in situations a user might not be aware of. Loom also supports 
closed-world reasoning. The strategy for closing a role for an individual 
is to count the number of known role fillers. However, in addition to the 
individuals explicitly mentioned in the ABox, existential quantifications 
and minimum number restrictions have to be considered. Assuming too 
few of these individuals might result in an inconsistency. This is demon- 
strated by a simple knowledge base example with the following ABox: 
{(3R.A n 3R.B n 3 R.C)(i), R(i,j)}. Let us assume that in the TBox there 
exist axioms such that A is implicitly declared as disjoint from both con- 
cepts, B and C. In the Loom system, specific reasoning techniques (e.g., 
a technique called “conditioning” [MacGregor, 1991b]) are implemented to 
compute the number of necessary fillers. Closing the role R for i by adding 
(< 1 R)(i) makes the ABox inconsistent. However, since Loom is incom- 
plete, it might be the case that the disjointness of A and B as well as A 
and C is not detected and, therefore, too few fillers are assumed to ex- 
ist in the closing process. Thus, the added maximum number restriction 
might be too restrictive, i.e. , the system is unsound if closed- world rea- 
soning is employed. Note that the semantic basis of automatic closing of 
roles as offered by Loom is hard to characterize for expressive representa- 
tion languages. Obviously, closing the role R for i with (< 2 R)(i) might be 
a candidate. However, closing the role R for i with (< 3 R)(i) might also 
be possible. In this case we have more individuals but with less specific 
constraints. 


Back and Flex 

Research on Back (Berlin Advanced Computational Knowledge represen- 
tation system) started in 1985, approximately at the same time as work 
on the Loom system was initiated. Back was also called a knowledge rep- 
resentation environment [Quantz and Kindermann, 1990; Peltason, 1991; 
Hoppe et al., 1993]. 

The Description Logic of the initial Back system can be called ABORT 1 . 
There was also support for reasoning with numbers and attribute sets. Re- 
search on the inference algorithms for the basic Back language stimulated 
the development of theoretical results on the complexity of concept con- 
sistency reasoning (e.g., [Nebel, 1988; 1990a]) and the semantics of cycles 
[Nebel, 1991]. Additionally, not only was terminological reasoning consid- 
ered but an investigation was made into the development of a hybrid ar- 
chitecture consisting of a TBox and an ABox. Issues of integration and 
balancing in hybrid knowledge representation systems, namely balanced 
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expressiveness and tight coupling in hybrid systems, were analyzed in [Nebel 
and von Luck, 1987; 1988]. Research on the Back system helped to shape 
the current view on balanced representation schemes with TBox and ABox. 
In order to provide an hybrid representation language, Back was one of 
the first systems in which TBox concept terms could also be used in an 
ABox to assert, e.g., disjunctive information about individuals. In addition, 
distinct individuals were assumed to denote distinct objects. Hence, the 
number of role fillers could be counted and compared against number re- 
strictions (this was also done in Krypton as pointed out by [Woods and 
Schmolze, 1992, p. 165]). The algorithms used in Back for instance check- 
ing and instance retrieval are described in [Nebel and von Luck, 1987; 1988; 
Kindermann and Randi, 1990]. In general, the discussion of the problems 
of incomplete algorithms that was sketched in the account of Loom also 
applies to the Back system because the inference algorithms used in Back 
are also known to be incomplete. 

In order to provide a knowledge representation environment, the Back 
architecture was designed to support incremental additions to the ABox. 
Back was one of the first attempts to implement algorithms for reasoning 
about retractions of ABox assertions. Back supported retraction of told 
information, also called literal retraction [Nebel, 1990a; Kindermann, 1992]. 
This is also supported in the Loom system. ABox assertions can be retrieved 
from a database by automatically computing SQL queries [Schmiedel, 1993] . 
For the applications considered in the Back project, reasoning about time 
was important. Therefore, an integration of temporal reasoning and termi- 
nological reasoning was investigated by several project members. Investiga- 
tions into how to incorporate temporal reasoning into terminological rea- 
soning are reported in [Schmiedel, 1988; 1990; Schild, 1993; Fischer, 1992; 
Neuwirth, 1993]. 

In the successor system Flex [Quantz et al, 1995], incomplete algorithms 
were implemented for the Description Logic ACCQIZTTO . Additionally, rea- 
soning about equations and inequalities concerning integers was supported. 
Furthermore, the Flex system served as a testbed for investigating so-called 
weighted defaults [Quantz and Royer, 1992]. The initial implementation of 
Flex was developed in Prolog. Flex++ was a reimplenrentation in C++. 
The implementation was faster, but for application knowledge bases the per- 
formance was not sufficient. Appropriate optimization techniques (see the 
next chapter) had not been investigated in the context of Description Logics 
at the time of the development of the Flex implementation. 

In general, it is quite difficult to compare different systems and knowl- 
edge representation environments because the services being offered and 
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the representation languages are not standardized (see [Patel-Schneider and 
Swartout, 1993] for a proposal on standardizing representation languages 
and inference services). Experiences with system implementations indicated 
that either limited expressiveness or incompleteness of reasoning could pos- 
sibly lead to problems in applications. Therefore, other researchers inves- 
tigated the implementation of systems based on sound and complete algo- 
rithms (published at the end of the 1980s and beginning of the 1990s). One 
can consider [Schmidt-Schaufi and Srnolka, 1991] as a starting point of this 
development (see also [Donini et al, 1991a]). Based on tableau calculi, prac- 
tical DL implementations were developed. We discuss the architectures of 
the systems Kris and Crack. 


Kris 

The development of sound and complete reasoning systems for more ex- 
pressive Description Logics started at the end of the 1980s. One of the 
main developments in this direction was the system Kris. The approach 
of Kris was to implement sound and complete algorithms for an expres- 
sive Description Logic and to develop optimization techniques for TBox 
reasoning so that, in practice, reasonable performance could be expected. 
The Description Logic of Kris is ACCAftF [Baader and Hollunder, 1991a; 
1991b]. As an addition, Kris provides enumerated types (O operator) and 
an experimental interface for reasoning about so-called concrete domains 
[Baader and Hanschke, 1991a; 1991b; 1992] (e.g., linear inequalities over 
the reals). Role conjunctions were supported with a prototype implementa- 
tion. The focus of the work in the Kris project was on TBox classification. 
Nevertheless, Kris was one of the first systems also supporting sound and 
complete ABox reasoning in expressive Description Logics. Even multiple 
ABoxes could be handled. The implementation language of Kris was Com- 
monLisp (see [Hollunder et al, 1991] for a User’s Guide and [Achilles et al, 
1991] for a description of the graphical user interface). 

The idea behind optimizing TBox classification was to exploit “obvi- 
ous” information concerning “told” superconcepts and primitive concepts. 
In many concept definitions of application knowledge bases the right-hand 
side is a conjunction with concept names and concept terms. The con- 
juncts which are concept names on the right-hand side are defined as 
the “told” subsumers. Another important point was to avoid recompu- 
tation of subsumption relations found in preceding computation steps. 
Thus, caching and propagation techniques were implemented. The idea 
was that information can be propagated in the subsumption lattice such 
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that expensive subsumption tests can be avoided where possible. Kris 
was the first system for which systematic empirical tests were carried 
out. The algorithms evaluated in [Baader et al, 1992b; 1994] are still 
in use in modern DL systems (see below). Extensions such as defaults 
were investigated as well (see also [Baader and Hollunder, 1992; 1993; 
Hollunder, 1994a]) but have not been implemented in Kris. 

Although the benchmarks considered in [Baader et al, 1994] revealed 
that the performance of Kris for TBox reasoning was comparable to that 
of other systems of that time, the more or less direct implementation of 
nondeterministic tableau algorithms that were developed for proving the 
decidability of problems in the field of theoretical computer science with 
chronological backtracking as in Kris led to performance problems for many 
applications. One of the main results of the Kris project was that sound and 
complete inference algorithms are an important starting point for research on 
optimized sound and complete algorithms for practical system development. 

Crack 

One of the main research goals of the system Crack was to implement 
sound and complete algorithms for dealing with inferences about individuals 
in concept terms. Rather than providing a non-standard semantics as in 
Classic (individuals are mapped onto sets of domain objects), in Crack, 
individuals are mapped to elements of the domain. Thus, coreferences also 
have to be considered in concept terms. Crack supports the Description 
Logic ACCIZTIFO [Bresciani et al, 1995]. The implementation of Crack is 
based on CommonLisp. Crack provided a web interface. 

In a similar way as in Kris, obvious information is exploited in the archi- 
tecture to some extent but, nevertheless, Crack is a direct implementation 
of the tableau rules of the underlying calculus. At the beginning of the 1990s 
it became clear that sound and complete reasoning is needed for many ap- 
plications but the inference techniques employed, which had been developed 
for (manually) deriving decidability results, e.g., with tableau algorithms, 
were not suited for direct implementation. Thus, it was realized that there 
is a long way to go from a decidability proof to a working system that has 
good performance in the average case. 


Other systems 

The list of systems we have discussed in this chapter is certainly incomplete. 
The large number of projects involved in the development of knowledge 
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representation systems shows the importance of this area. Usually DL sys- 
tems are built around a core engine which is a consistency checker. However, 
there are other services to be supplied which are also important to make the 
systems usable in larger application projects. We present an overview of some 
additional systems with interesting features developed at the beginning of 
the 1990s. 

Among other points, the graphical manipulation of representations was 
investigated in the Sb-One project [Allgayer, 1990; Kobsa, 1991b; 1991a]. 
The implementation language was CommonLisp. Techniques for graphical 
interfaces to support knowledge base development with Sb-One are de- 
scribed in [Kalrnes, 1988; 1990] (see also [Abrett and Burstein, 1987] for a 
description of the Kreme system). Furthermore, in Sb-One the use of con- 
texts (also called partitions) was explored for user modeling applications in 
natural language generation. 

Another important point for DL inference systems is persistence and 
transaction management. We have already discussed the Back approach 
[Schmiedel, 1993] (see also [Borgida, 1995]). Additional investigations were 
also made with the K-Rep system [Mays et al, 1991a; 1991b]. 


Summary: standard inference services of Description 
Logics systems 

Before discussing successors of the second generation systems presented in 
this section it is appropriate to summarize the main inference problems 
that are now assumed as standard for DL systems. The inference services 
provided by DL systems for concept consistency and TBox reasoning can be 
summarized as follows: 

• Concept consistency (w.r.t. a TBox). 

• Concept subsumption (w.r.t. a TBox). 

• Another important inference service for practical knowledge representation is to 
check whether a certain concept name is inconsistent w.r.t. a TBox. Usually, 
inconsistent concept names are the consequence of modeling errors. Checking the 
consistency of all concept names mentioned in a TBox without computing the 
parents and children is called a TBox coherence check. 

• The problem of computing the most specific concept names mentioned in a TBox 
that subsume a certain concept is known as computing the parents of a concept. 
The children are the most general concept names mentioned in a TBox that 
are subsumed by a certain concept. We use the name concept ancestors ( concept 
descendants) for the transitive closure of the parents (children) relation. The 
computation of the parents and children of every concept name is also called 
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classification of the TBox. This inference is needed to build a hierarchy of concept 
names w.r.t. specificity and is known as TBox classification. 

If a system supports ABox reasoning, the following inference services are 
provided: 

• ABox consistency (w.r.t. a TBox). 

• Instance checking w.r.t. a TBox and an ABox. 

• The most specific concept names mentioned in a TBox T of which an individual 
is an instance are called the direct types of the individual w.r.t. a TBox and an 
ABox. 

• The retrieval inference problem is to find all individuals mentioned in an ABox 
that are an instance of a given concept C w.r.t. a TBox. 

• The set of fillers of a role R for an individual i w.r.t. a TBox T and an ABox A 
is defined as {x | (T, A) |= (i,x) : R} where (T, A) |= ax means that all models 
of T and A are also models of ax. 

• The set of roles between two individuals i and j w.r.t. a knowledge base (T, A) 
is defined as {R \ ( T,A ) |= (i,j) : R}. 

In many DL systems, there are some auxiliary queries supported: retrieval 
of the concept names or individuals mentioned in a knowledge base, retrieval 
of the set of roles, retrieval of the role parents and children (defined analo- 
gously to the concept parents and children, see above), retrieval of the set 
of individuals in the domain and in the range of a role, etc. As we have 
discussed in this section, DL systems of the second generation offer most if 
not all of these inference services. An exception is a language for specify- 
ing retrieval queries that goes beyond the simple retrieval inference problem 
mentioned above (see e.g., the discussion about Loom). 


8.4 The next generation: FaCT, Dlp and Racer 

The declarative nature of DL modeling is even more important when 
problems are treated for which languages are required that are no longer 
tractable. Inspired by theoretical advances - e.g., for handling number re- 
strictions, role conjunctions, generalized concept inclusions and cyclic ax- 
ioms with descriptive semantics ( ACCNlZ [Buchheit et al, 1993a]); transi- 
tive roles (ACC } ?+ [Sattler, 1996]); role hierarchies and features (ACCfi/nr 
[Horrocks, 1998b]); and inverse roles, qualified number restrictions, and role 
hierarchies (SUXQ [Horrocks et al., 1999] also called ACCQH1r + , pro- 
nounced ALC-choir) - the development of another generation of sound and 
complete DL systems was started at the end of the 1990s. 
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Initially, research on practical implementations of DL systems for expres- 
sive Description Logics started with a focus on concept and TBox reason- 
ing. However, rather than directly implementing the tableau calculus used 
for the theoretical decidability proofs and complexity analyses, a rigorous 
investigation into methods for informed search was made for developing 
the next generation of DL systems. In particular, average-case optimization 
techniques have been investigated with the system FaCT ([Horrocks, 1997a; 
1998b; Horrocks and Patel-Schneider, 1999] see also the subsequent chapter 
for details). At the time of this writing, two versions of FaCT are available. 
One version supports TBox reasoning for the Description Logic ACCHfn+ 
[Horrocks, 1997a; 1998b]. Furthermore, a newer version of FaCT also sup- 
ports TBox reasoning with inverse roles and qualified number restrictions 
(< STL1Q [Horrocks, 1999; Horrocks et al, 1999]). At the time of this writing, 
FaCT does not support ABoxes. 

It was the FaCT system that first demonstrated the usefulness of expres- 
sive Description Logics for developing practical applications. It was shown 
that, although runtime behavior can be exponential in the worst case, in 
practical contexts, optimization techniques can be found that prevent a DL 
system from running into combinatorial explosion. Nevertheless, the algo- 
rithms are still sound and complete. Indeed, after several years of experience 
with less expressive systems such as Classic, research on FaCT stimulated 
many research activities for developing optimized DL system implementa- 
tions for expressive Description Logics. 

The system FaCT is implemented in CommonLisp and can be down- 
loaded with source code for research purposes. A CORBA interface guar- 
antees seamless integration into network- aware applications. Various input 
formats are supported by FaCT (e.g., for XML-based notations of TBoxes). 
The graphical interface OilEd for developing TBoxes in the spirit of frame 
systems is described in [Bechhofer et al., 2001b]. 


Dlp 

Based on similar techniques to FaCT, the system Dlp utilizes extended 
techniques for optimizations [Horrocks and Patel-Schneider, 1998c; 1998d; 
Patel-Schneider, 1999]. Dlp supports concept consistency reasoning for the 
Description Logic ACCM reg . From a modal logic perspective, ACCM reg can 
also be called Propositional Dynamic Logic (PDL) with a restricted form of 
graded modalities, i.e., simple number restrictions. 
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Dlp has succeeded in many performance competitions [Horrocks, 1998a; 
Horrocks and Patel-Schneider, 1998c; Patel-Schneider, 1999]. It was shown 
that tableau-based approaches can be implemented such that the perfor- 
mance for satisfiability testing for ACC or modal logic K m is comparable to 
traditional approaches used in the community [Giunchiglia and Sebastiani, 
1996b; Giunchiglia et al . , 1999]. 

However, in the current version of Dlp TBox classification is not provided 
as an inference service. In particular, no generalized concept inclusions and 
no TBoxes with forward references are supported (i.e. , algorithms for dealing 
with generalized concept inclusions are not implemented in Dlp). ABoxes 
are not supported either. Dlp is implemented in SML. 


Racer 

For many applications, besides concept consistency and TBox reason- 
ing, ABox reasoning is also important. Calculi for ABox consistency 
have been presented for the above-mentioned representation constructs: 
ACCMlZ [Buchheit et al . , 1993b], ACCAIHr+ [Haarslev and Moller, 2000], 
ACCQHIr+ ( SHTQ ) [Horrocks et al, 2000c]. Based on theoretical re- 
sults, a practical implementation of ABox calculi was developed with 
the full TBox and ABox DL system Racer [Haarslev and Moller, 1999; 
2001e]. Racer supports all optimization techniques that are incorporated 
into FaCT. Some new optimization techniques investigated with the Racer 
system (e.g., for dealing with number restrictions and ABoxes) are men- 
tioned in the next chapter. In Racer, the unique name assumption for ABox 
individuals is imposed. In order to demonstrate the usefulness of DL systems 
for practical applications, high performance reasoning for large TBoxes is 
discussed in [Haarslev and Moller, 2001c]. 

Initial versions of the Racer system supported the logic ACCNTIr+. 
In later versions reasoning was extended to ABox reasoning with the logic 
ACCQT12r+ ( STilQ ). In addition, Racer supports concrete domains with- 
out so-called feature chains (see [Baader and Hanschke, 1991a] and the dis- 
cussion of the Kris system). In particular, predicates representing linear in- 
equalities about the reals are handled by Racer (see [Haarslev et al, 2001; 
Haarslev and Moller, 2001b] for details). 

Racer dynamically selects appropriate optimization techniques based on 
a static analysis of input TBoxes, ABoxes and queries. As a distinguish- 
ing feature, which is important for many applications, it should also be 
mentioned that Racer supports multiple TBoxes and ABoxes (see also the 
Kris system). Assertions can be added to ABoxes after queries have been 
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answered. In addition, for instance, Racer also provides support for retrac- 
tion of assertions. 

Racer can be downloaded for research purposes as a server program for 
standard operating systems with no additional licenses. A socket-based net- 
work version with Java interface is available. The implementation language 
of Racer is CommonLisp. 


8.5 Lessons learned 

Considering the evolving technology of DL systems it becomes clear that 
since the end of the 1990s there has been an enormous interest in DL rea- 
soning systems. This is demonstrated by the quite large number of system 
implementations. Currently, all modern DL systems are based on sound 
and complete algorithms. Thus, system developers can really rely on all 
answers computed by a DL system. This positive trend has been initiated 
by the development of optimization techniques that ensure stable runtimes 
for average-case inputs for real-world problems even if the worst-case com- 
plexity is exponential (see also below). The trend has been initiated by the 
landmark system FaCT. 

The original idea of the Tell&Ask interface of Krypton is still realized 
in modern systems. However, at the time of this writing, the systems sup- 
port only some kind of batch-oriented behavior. A knowledge base (TBox 
and ABox) is passed to the systems (tell interface). Afterwards, queries can 
be answered (ask interface). But, no incremental additions to the knowl- 
edge base are possible after the first query is answered. The difficulty is 
that complex transformations on the knowledge bases are necessary in or- 
der to compute an internal representation that can be used for relatively 
fast query answering (see the discussion on optimization techniques in sub- 
sequent chapters). The price to pay is that algorithms for appropriately 
handling incremental additions to a knowledge base are not yet known. 
Other features, e.g., explanation facilities, retraction, etc., still have to be 
developed for expressive DLs as well. 

As a second and quite important lesson one can see that Description 
Logics with more expressiveness and sound and complete algorithms impose 
a different view in modeling. Concept definitions as known from, for instance, 
Classic are no longer the central modeling device if generalized concept 
inclusions (representing cyclic implications or equalities) are available. 6 

A third lesson we can learn from considering DL systems and their de- 


Nevertheless, Description Logics can still be called object-based representation formalisms, 
although there are some approaches to deal with n-ary relations [Schmolze, 1989; Calvanese 
et al ., 1998d] as well. 
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velopment is that speed depends much more on the expressiveness of the 
Description Logic than it does on the implementation language. What re- 
ally counts is the set of optimization strategies, the implementation of index 
data structures and the selection of clever heuristics. There are first attempts 
at providing a distributed implementation of a DL system. However, perfor- 
mance problems in network communication lead to server-based solutions, 
i.e., a knowledge base is being processed at a single workstation computer 
(but may be accessed from different clients). Benchmark generators and 
standardized application knowledge bases are used for metering system per- 
formance. Thus, different system implementations can be compared. 

With Racer we have discussed a state-of-the-art DL system that also 
supports ABoxes and concrete domains. However, only simple query lan- 
guages are currently available. For Description Logics without inverse roles 
and number restrictions (i.e., AjC,CHfn+), Tessaris [2001] developed the basic 
techniques for supporting the so-called conjunctive queries in DL systems. 
However, for Description Logics as expressive as STCLQ much less is known 
from an implementation standpoint. 

Another lesson is that the development of techniques for practically incor- 
porating facilities for the representation of space and time into Description 
Logics is still an open issue. The necessity of a semantics-based integration 
of temporal and terminological reasoning has been emphasized in first inves- 
tigations in the Back project. However, early approaches (e.g., [Schmiedel, 
1990]) have been shown to be undecidable [Halpern and Shoharn, 1991; 
Schild, 1993]. In the context of planning, the opportunities of an inte- 
grated environment combining temporal and terminological reasoning were 
clearly demonstrated with the RHET system [Allen, 1991]. It has been 
shown that spatial reasoning (e.g., about topological relations) induces non- 
obvious subsumption relationships between concepts [Haarslev et al, 1998; 
1999]. The work presented in [Artale et al, 2001] demonstrates that the 
decidability barrier is reached if temporal operators are integrated into ex- 
pressive Description Logics. Nevertheless, [Artale et al, 2001] identify a 
fragment that allows a limited kind of practical modeling. Initial experi- 
ments concerning an implementation of a Description Logic that supports 
operators for linear time temporal reasoning are discussed in [Giinsel and 
Wittmann, 2001]. 
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Abstract 

This chapter will discuss the implementation of the reasoning services which 
form the core of DL-based knowledge representation systems. To be useful 
in realistic applications, such systems need both expressive logics and fast 
reasoners. As expressive logics inevitably have high worst-case complexities, 
this can only be achieved by employing highly optimized implementations 
of suitable reasoning algorithms. Systems based on such implementations 
have demonstrated that they can perform well with problems that occur 
in realistic applications, including problems where unoptimized reasoning is 
hopelessly intractable. 


9.1 Introduction 

The usefulness of Description Logics in applications has been hindered by 
the basic conflict between expressiveness and tractability. Realistic appli- 
cations typically require both expressive logics, with inevitably high worst- 
case complexities for their decision procedures, and acceptable performance 
from the reasoning services. Although the definition of acceptable may vary 
widely from application to application, early experiments with Description 
Logics indicated that, in practice, performance was a serious problem, even 
for logics with relatively limited expressive powers [Heinsohn et al., 1992]. 

On the other hand, theoretical work has continued to extend our un- 
derstanding of the boundaries of decidability in Description Logics, and 
has led to the development of sound and complete reasoning algorithms 
for much more expressive logics. The expressive power of these logics 
goes a long way towards addressing the criticisms leveled at Description 
Logics in traditional applications such as ontological engineering [Doyle 
and Patil, 1991] and is sufficient to suggest that they could be useful 


329 



330 


I. Horrocks 


in several exciting new application domains, for example reasoning about 
database schemas and queries [Calvanese et al, 1998f; 1998a] and provid- 
ing reasoning support for the so-called Semantic Web [Decker et al., 2000; 
Bechhofer et al., 2001b]. However, the worst-case complexity of their deci- 
sion procedures is invariably (at least) exponential with respect to problem 
size. 

This high worst-case complexity initially led to the conjecture that ex- 
pressive Description Logics might be of limited practical applicability [Buch- 
heit et al., 1993c]. However, although the theoretical complexity results are 
discouraging, empirical analyses of real applications have shown that the 
kinds of construct which lead to worst-case intractability rarely occur in 
practice [Nebel, 1990b; Heinsohn et al., 1994; Speel et al., 1995], and ex- 
periments with the Kris system showed that applying some simple opti- 
mization techniques could lead to a significant improvement in the empir- 
ical performance of a DL system [Baader et al., 1992a]. More recently the 
FaCT, Dlp and Racer systems have demonstrated that, even with very 
expressive logics, highly optimized implementations can provide acceptable 
performance in realistic applications [Horrocks and Patel-Schneider, 1999; 
Haarslev and Moller, 2001c]. 1 

In this chapter we will study the implementation of DL systems, exam- 
ining in detail the wide range of optimization techniques that can be used 
to improve performance. Some of the techniques that will be discussed are 
completely independent of the logical language supported by the Description 
Logic and the kind of algorithm used for reasoning; many others would be 
applicable to a wide range of languages and implementation styles, partic- 
ularly those using search-based algorithms. However, the detailed descrip- 
tions of implementation and optimization techniques will assume, for the 
most part, reasoning in an expressive DL based on a sound and complete 
tableau algorithm. 


9.1.1 Performance analysis 

Before designing and implementing a DL-based knowledge representation 
system, the implementor should be clear about the goals that they are trying 
to meet and against which the performance of the system will ultimately be 
measured. In this chapter it will be assumed that the primary goal is utility 
in realistic applications, and that this will normally be assessed by empirical 
analysis. 

1 It should be pointed out that experience in this area is still relatively limited. 
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Unfortunately, as DL systems with very expressive logics have only re- 
cently become available [Horrocks, 1998a; Patel-Schneider, 1998; Haarslev 
and Moller, 2001e], there are very few applications that can be used as a 
source for test data. 2 One application that has been able to provide such data 
is the European Galen project, part of which has involved the construc- 
tion of a large DL knowledge base describing medical terminology [Rector 
et al., 1993]. Reasoning performance with respect to this knowledge base has 
been used for comparing DL systems [Horrocks and Patel-Schneider, 1998b], 
and we will often refer to it when assessing the effectiveness of optimization 
techniques. 

As few other suitable knowledge bases are available, the testing of DL 
systems has often been supplemented with the use of randomly generated or 
hand crafted test data [Giunchiglia and Sebastiani, 1996b; Heuerding and 
Schwendimann, 1996; Horrocks and Patel-Schneider, 1998b; Massacci, 1999; 
Donini and Massacci, 2000]. In many cases the data was originally developed 
for testing propositional modal logics, and has been adapted for use with 
Description Logics by taking advantage of the well-known correspondence 
between the two formalisms [Schild, 1991]. Tests using this kind of data, in 
particular the test suites from the Tableaux’98 comparison of modal logic 
theorem provers [Balsiger and Heuerding, 1998] and the DL’98 comparison 
of DL systems [Horrocks and Patel-Schneider, 1998b], will also be referred 
to in assessments of optimization techniques. 


9.2 Preliminaries 

This section will introduce the syntax and semantics of Description Logics 
(full details of which can be found in Chapter 2) and discuss the reasoning 
services which would form the core of a DL-based knowledge representation 
system. It will also discuss how, through the use of unfolding and inter- 
nalization, these reasoning services can often be reduced to the problem of 
determining the satisfiability of a single concept. 


9.2.1 Syntax and semantics 

Description Logics are formalisms that support the logical description of 
concepts and roles. Arbitrary concept and role descriptions (from now on 
referred to simply as concepts and roles) are constructed from atomic con- 
cept and role names using a variety of concept- and role-forming operators, 

2 This situation is changing rapidly, however, with the increasing use of Description Logics in 
database and ontology applications. 
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the range of which is dependent on the particular logic. In the following 
discussion we will use C and D to denote arbitrary concepts, R and S to 
denote arbitrary roles, A and P to denote atomic concept and role names, 
and n to denote a nonnegative integer. 

For concepts, the available operators usually include some or all of the 
standard logical connectives, conjunction (denoted n), disjunction (denoted 
U) and negation (denoted ->). In addition, the universal concept top (denoted 
T, and equivalent to A U -> A) and the incoherent concept bottom (denoted 
_L, and equivalent to in — i A) are often pre-defined. Other commonly sup- 
ported operators include restricted forms of quantification called existential 
role restrictions (denoted 3R.C) and universal role restrictions (denoted 
Vi?. C). Some Description Logics also support qualified number restrictions 
(denoted ^ n .PC and ^ n .PC'), operators that place cardinality restrictions 
on the roles relating instances of a concept to instances of some other con- 
cept. Cardinality restrictions are often limited to the forms ^n.PT and 
fin. PT, when they are called unqualified number restrictions, or simply 
number restrictions, and are often abbreviated to ^ n P and ^ nP. The roles 
that can appear in cardinality restriction concepts are usually restricted to 
being atomic, as allowing arbitrary roles in such concepts is known to lead 
to undecidability [Baader and Sattler, 1996b]. 

Role-forming operators may also be supported, and in some very expres- 
sive logics roles can be regular expressions formed using union (denoted U), 
composition (denoted o), reflexive-transitive closure (denoted *) and identity 
operators (denoted id), possibly augmented with the inverse (also known as 
converse ) operator (denoted _ ) [De Giacomo and Lenzerini, 1996]. In most 
implemented systems, however, roles are restricted to being atomic names. 

Concepts and roles are given a standard Tarski-style model-theoretic se- 
mantics, their meaning being given by an interpretation I = (A 1 ), where 

A x is the domain (a set) and x is an interpretation function. Full details of 
both syntax and semantics can be found in Chapter 2. 

In general, a DL knowledge base (KB) consists of a set T of terminological 
axioms, and a set A of assertional axioms. The axioms in T state facts about 
concepts and roles while those in A state facts about individual instances 
of concepts and roles. As in this chapter we will mostly be concerned with 
terminological reasoning, that is reasoning about concepts and roles, a KB 
will usually be taken to consist only of the terminological component T. 

A terminological KB T usually consists of a set of axioms of the form C C 
D and C = D, where C and D are concepts. An interpretation X satisfies T if 
for every axiom {C C D) € T , C x C D x , and for every axiom (C = D) € T , 
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C x = D x : Z is satisfiable if there exists some non-empty interpretation that 
satisfies it. Note that T can, without loss of generality, be restricted to 
contain only inclusion axioms or only equality axioms, as the two forms can 
be reduced one to the other using the following equivalences: 

CCD <*=*► TeDUiC 
C = D CCDandDCC. 

A concept C is subsumed by a concept D with respect to T (written 
T J= C CC D) if C x C D x in every interpretation Z that satisfies T, a con- 
cept C is satisfiable with respect to T (written T \= C % _L) if C x 0 in 
some Z that satisfies T, and a concept C is unsatisfiable (not satisfiable) 
with respect to T (written Z |= -> C) if C x = 0 in every Z that satisfies Z. 
Subsumption and (un)satisfiability are closely related. If Z |= C CC D, then 
in all interpretations Z that satisfy Z. C x C D x and so C x n (-> D) x = 0. 
Conversely, if C is not satisfiable with respect to Z . then in all Z that sat- 
isfy Z, C 1 = 0 and so C x C l_ x . Subsumption and (un) satisfiability can thus 
be reduced one to the other using the following equivalences: 

Z\=CQD ^ Z |= -.(C n -.D) 

Z |= -i (7 ««=► T|=CCA. 

In some Description Logics Z can also contain axioms that define a set of 
transitive roles R+ and/or a subsumption partial ordering on roles [Horrocks 
and Sattler, 1999]. An axiom R e R + states that R is a transitive role while 
an axiom DCS states that R is subsumed by S. An interpretation Z satisfies 
the axiom R £ R + if R x is transitively closed (i.e. , ( R x ) + = R x ), and it 
satisfies the axiom DCS if R x C S x . 


9.2.2 Reasoning services 

Terminological reasoning in a DL-based knowledge representation system 
is based on determining subsumption relationships with respect to the 
axioms in a KB. As well as answering specific subsumption and satisfiability 
queries, it is often useful to compute and store (usually in the form of a 
directed acyclic graph) the subsumption partial ordering of all the concept 
names appearing in the KB, a procedure known as classifying the KB 
[Patel-Schneider and Swartout, 1993]. Some systems may also be capable 
of dealing with assertional axioms, those concerning instances of concepts 
and roles, and performing reasoning tasks such as realization (determining 
the concepts instantiated by a given individual) and retrieval (determining 
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the set of individuals that instantiate a given concept) [Baader et al, 
1991]. However, we will mostly concentrate on terminological reasoning 
because it has been more widely used in DL applications. Moreover, given 
a sufficiently expressive Description Logic, assertional reasoning can be 
reduced to terminological reasoning [De Giacomo and Lenzerini, 1996]. 

In practice, many systems use subsumption-testing algorithms that are 
not capable of determining subsumption relationships with respect to an 
arbitrary KB. Instead, they restrict the kinds of axiom that can appear in 
the KB so that dependency-eliminating substitutions (known as unfolding ) 
can be performed prior to evaluating subsumption relationships. These re- 
strictions require that all axioms are unique , acyclic definitions. An axiom 
is called a definition of A if it is of the form A C D or A = D, where A is an 
atomic name, it is unique if the KB contains no other definition of A, and it 
is acyclic if D does not refer either directly or indirectly (via other axioms) 
to A. A KB that satisfies these restrictions will be called an unfoldable KB. 

Definitions of the form ACD are sometimes called primitive or necessary , 
as D specifies a necessary condition for instances of A, while those of the form 
A = D are sometimes called non-primitive or necessary and sufficient as D 
specifies conditions that are both necessary and sufficient for instances of A. 
In order to distinguish non-definitional axioms, they are often called general 
axioms [Buchheit et al., 1993a]. Restricting the KB to definition axioms 
makes reasoning much easier, but significantly reduces the expressive power 
of the DL. However, even with an unrestricted (or general ) KB, definition 
axioms and unfolding are still useful ideas, as they can be used to optimize 
the reasoning procedures (see Subsection 9.4.3). 


9.2.3 Unfolding 

Given an unfoldable KB T, and a concept C whose satisfiability is to be 
tested with respect to T, it is possible to eliminate from C all concept names 
occurring in T using a recursive substitution procedure called unfolding. 
The satisfiability of the resulting concept is independent of the axioms in T 
and can therefore be tested using a decision procedure that is only capable 
of determining the satisfiability of a single concept (or equivalently, the 
satisfiability of a concept with respect to an empty KB). 

For a non-primitive concept name A, defined in T by an axiom A = D, the 
procedure is simply to replace A with D wherever it occurs in C, and then 
to recursively unfold D. For a primitive concept name A, defined in T by an 
axiom A CD, the procedure is slightly more complex. Wherever A occurs 
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in C it is replaced with the concept A 7 n D, where A 7 is a new concept name 
not occurring in X or C , and D is then recursively unfolded. The concept 
A 7 represents the “primitiveness” of A - the unspecified characteristics that 
differentiate it from D. We will use Unfold(C', X) to denote the concept C 
unfolded with respect to a KB X. 

A decision procedure that tries to find a satisfying interpretation X for 
the unfolded concept can now be used, as any such interpretation will also 
satisfy X. This can easily be shown by applying the unfolding procedure 
to all of the concepts forming the right-hand side of axioms in X, so that 
they are constructed entirely from concept names that are not defined in X, 
and are thus independent of the other axioms in X. The interpretation of 
each defined concept in X can then be taken to be the interpretation of the 
unfolded right-hand side concept, as given by X and the semantics of the 
concept and role forming operators. 

Subsumption reasoning can be made independent of X using the same 
technique. Given two concepts C and D, determining if C is subsumed by 
D with respect to X is the same as determining if Unfold(C, X) is subsumed 
by Unfold(X,X) with respect to an empty KB: 

T\=C AD <=► 0 |= Unfold(C,X) C Unfold(£>, X). 

Unfolding would not be possible, in general, if the axioms in X were 
not unique acyclic definitions. If X contained multiple definition axioms 
for some concept A, for example {(A = C), (A = D)} C X, then it would 
not be possible to make a substitution for A that preserved the meaning 
of both axioms. If X contained cyclic axioms, for example (A A 3R. A) £ 
X, then trying to unfold A would lead to non-termination. If X contained 
general axioms, for example 3R.C A D, then it could not be guaranteed 
that an interpretation satisfying the unfolded concept would also satisfy 
these axioms. 


9.2.4 Internalization 

While it is possible to design an algorithm capable of reasoning with respect 
to a general KB [Buchheit et al., 1993a], with more expressive logics, in 
particular those allowing the definition of a universal role , a procedure called 
internalization can be used to reduce the problem to that of determining 
the satisfiability of a single concept [Baader, 1991]. A truly universal role 
is one whose interpretation includes every pair of elements in the domain 
of interpretation (i.e. , A x x A x ). However, a role U is universal w.r.t. a 
terminology X if it is defined such that U is transitively closed and PCI/ 
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for all role names P occurring in T . For a logic that supports the union 
and transitive reflexive closure role- forming operators, this can be achieved 
simply by taking U to be 

(PiU-.UPnUPj-U-UP-)*, 

where Pi, . . . , P n are all the roles names occurring in T . For a logic that sup- 
ports transitively closed roles and role inclusion axioms, this can be achieved 
by adding the axioms 


(U € R+), (Pi E C/),... , (Pn E U), (Pr E U), . . . , (PE E U) 

to T, where Pi, . . . , P n are all the roles names occurring in T and U is a 
new role name not occurring in T. Note that in either case, the inverse role 
components are only required if the logic supports the inverse role operator. 

The concept axioms in T can be reduced to axioms of the form T E C 
using the equivalences: 

A = B TE(AU -iB) n (-iA U B) 

A\ZB <*=*► TEtAuB. 

These axioms can then be conjoined to give a single axiom TEC*, where 

C= I - ! ((Ai u -.Sj) n (^Ai u BA) n IH (-.Tufi,). 

(A,=B0er (AjQBj)eT 

Because the interpretation of T is equal to the domain (T 1 = A 2 "), this 
axiom states that every element in the domain must satisfy C. When testing 
the satisfiability of a concept D with respect to T, this constraint on possible 
interpretations can be imposed by testing the satisfiability of D n C n VU.C 
(or simply D n VC /. C in the case where U is transitively reflexively closed). 
This relies on the fact that satisfiable DL concepts always have an interpre- 
tation in which every element is connected to every other element by some 
sequence of roles (the collapsed model property) [Schild, 1991]. 


9.3 Subsumption-testing algorithms 

The use of unfolding and internalization means that, in most cases, termi- 
nological reasoning in a DL-based Knowledge Representation System can be 
reduced to subsumption or satisfiability reasoning. There are several algo- 
rithmic techniques for computing subsumption relationships, but they divide 
into two main families: structural and logical. 
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9.3.1 Structural subsumption algorithms 

Structural algorithms were used in early DL systems such as Kl-One 
[Brachnran and Schmolze, 1985], Nikl [Kaczmarek et ai, 1986] and Kryp- 
ton [Brachman et ai, 1983a], and are still used in systems such as Classic 
[Patel-Schneider et al . , 1991], Loom [MacGregor, 1991b] and Grail [Rec- 
tor et ai, 1997]. To determine if one concept subsumes another, structural 
algorithms simply compare the (normalized) syntactic structure of the two 
concepts (see Chapter 2). 

Although such algorithms can be quite efficient [Borgida and Patel- 
Schneider, 1994; Heinsohn et al . , 1994], they have several disadvantages: 

• Perhaps the most important disadvantage of this type of algorithm is that while 
it is generally easy to demonstrate the soundness of the structural inference rules 
(they will never infer an invalid subsumption relationship), they are usually in- 
complete (they may fail to infer all valid subsumption relationships). 

• It is difficult to extend structural algorithms in order to deal with more expres- 
sive logics, in particular those supporting general negation, or to reason with 
respect to an arbitrary KB. This lack of expressive power makes the DL system 
of limited value in traditional ontological engineering applications [Doyle and 
Patil, 1991], and completely useless in database schema reasoning applications 
[Calvanese et al., 1998f] . 

• Although accepting some degree of incompleteness is one way of improving the 
performance of a DL reasoner, the performance of incomplete reasoners is highly 
dependent on the degree of incompleteness, and this is notoriously difficult to 
quantify [Borgida, 1992a]. 


9. 3. 2 Logical algorithms 

These kinds of algorithm use a refutation-style proof: C is subsumed by 
D if it can be shown that the existence of an individual x that is in the 
extension of C (x E C x ) but not in the extension of D (x ^ D x ) is logically 
inconsistent. As we have seen in Subsection 9.2.2, this corresponds to testing 
the logical (un)satisfiability of the concept C n —>D (i.e. , C C D iff C n —> D 
is not satisfiable) . Note that forming this concept obviously relies on having 
full negation in the logic. 

Various techniques can be used to test the logical satisfiability of a concept. 
One obvious possibility is to exploit an existing reasoner. For example, the 
LogicsWorkbench [Balsiger et ai, 1996], a general-purpose propositional 
modal logic reasoning system, could be used simply by exploiting the well- 
known correspondences between description and modal logics [Schild, 1991]. 
First-order logic theorem p rovers can also be used via appropriate traslations 
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of Description Logics into first-order logic. Examples of this approach can be 
seen in systems developed by Hustadt and Schmidt [1997], using the Spass 
theorem prover, and Paramasivam and Plaisted [1998], using the CLIN-S 
theorem prover. An existing reasoner could also be used as a component of a 
more powerful system, as in Ksat/*Sat [Giunchiglia and Sebastiani, 1996a; 
Giunchiglia et al, 2002], where a propositional satisfiability (SAT) tester is 
used as the key component of a propositional modal satisfiability reasoner. 

There are advantages and disadvantages to the “re-use” approach. On 
the positive side, it should be much easier to build a system based on an 
existing reasoner, and performance can be maximized by using a state-of-the- 
art implementation such as Spass (a highly optimized first-order theorem 
prover) or the highly optimized SAT testing algorithms used in Ivsat and 
*Sat (the use of a specialized SAT tester allows *Sat to outperform other 
systems on classes of problem that emphasize propositional reasoning) . The 
translation (into first-order logic) approach has also been shown to be able to 
deal with a wide range of expressive Description Logics, in particular those 
with complex role-forming operators such as negation or identity [Hustadt 
and Schmidt, 2000]. 

On the negative side, it may be difficult to extend the reasoner to deal 
with more expressive logics, or to add optimizations that take advantage 
of specific features of the Description Logic, without reimplementing the 
reasoner (as has been done, for example, in more recent versions of the 
*Sat system). 

Most, if not all, implemented DL systems based on logical reasoning have 
used custom designed tableau decision procedures. These algorithms try to 
prove that D subsumes C by starting with a single individual satisfying 
C n -i D, and demonstrating that any attempt to extend this into a com- 
plete interpretation (using a set of tableau expansion rules ) will lead to a 
logical contradiction. If a complete and non-contradictory interpretation is 
found, then this represents a counterexample (an interpretation in which 
some element of the domain is in C x but not in D x ) that disproves the 
conjectured subsumption relationship. 

This approach has many advantages and has dominated recent DL re- 
search: 

• It has a sound theoretical basis in first-order logic [Hollunder et al., 1990]. 

• It can be relatively easily adapted to allow for a range of logical languages by 
changing the set of tableau expansion rules [Hollunder et al., 1990; Bresciani 
et al., 1995]. 

• It can be adapted to deal with very expressive logics, and to reason with respect 
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to an arbitrary KB, by using more sophisticated control mechanisms to ensure 
termination [Baader, 1991; Buchheit et al., 1993c; Sattler, 1996]. 

• It has been shown to be optimal for a number of DL languages, in the sense that 
the worst-case complexity of the algorithm is no worse than the known complexity 
of the satisfiability problem for the logic [Hollunder et al ., 1990]. 

In the remainder of this chapter, detailed descriptions of implementation 
and optimization techniques will assume the use of a tableau decision proce- 
dure. However, many of the techniques are independent of the subsumption- 
testing algorithm or could easily be adapted to most logic-based methods. 
The reverse is also true, and several of the described techniques have been 
adapted from other logical decision procedures, in particular those that try 
to optimize the search used to deal with non-determinism. 

9.3.2. 1 Tableau algorithms 

Tableau algorithms try to prove the satisfiability of a concept D by 
constructing a model , an interpretation Z in which D x is not empty. A 
tableau is a graph which represents such a model, with nodes corresponding 
to individuals (elements of A- 1 ) and edges corresponding to relationships 
between individuals (elements of A x x A 1 ). 

A typical algorithm will start with a single individual satisfying D and 
try to construct a tableau, or some structure from which a tableau can 
be constructed, by inferring the existence of additional individuals or of 
additional constraints on individuals. The inference mechanism consists of 
applying a set of expansion rules which correspond to the logical constructs 
of the language, and the algorithm terminates either when the structure is 
complete (no further inferences are possible) or when obvious contradictions 
have been revealed. Non-determinism is dealt with by searching different 
possible expansions: the concept is unsatisfiable if every expansion leads 
to a contradiction and is satisfiable if any possible expansion leads to the 
discovery of a complete non-contradictory structure. 

Theoretical presentations of tableau algorithms use a variety of notational 
styles including constraints [Hollunder et al., 1990], prefixes [De Giacomo 
and Massacci, 1996] and labeled graphs [Sattler, 1996]. We will use the la- 
beled graph notation as it has an obvious correspondence with standard 
implementation techniques. In its basic form, this notation describes the 
construction of a directed graph (usually a tree) in which each node x is 
labeled with a set of concepts (£(x) = {Ci, . . . , C n }), and each edge (x, y ) 
is labeled with a role (C((x,y)) = R). When a concept C is in the label 
of a node x (C G C(x)), it represents a model in which the individual 
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n-rule 

if 1. 

(CnD)e C( x) 


2. 

{C,D}£C(,x) 


then 

C{x) — >C{x)U{C,D} 

U-rule 

if 1. 

(CuD)e C(x) 


2. 

(C, D} n C(x) = 0 


then 

either Cfx) — > C(x) U {G} 
or C(x) — * C(x) U {£>} 

3-rule 

if 1. 

3 R.C € C( x) 


2. 

there is no y s.t. C({x,y )) = R and C 6 C(y) 


then 

create a new node y and edge (x, y) 
with C(y) — {G} and C({x,y)) = R 

V-rule 

if 1. 

MR..C € C(x) 


2. 

there is some y s.t. C{{x,y)) = R and C £ C{y) 


then 

C(y) — - C(y) U {G} 

Fig. 9.1 

. Tableau expansion rules for ACC. 


corresponding with x is in the interpretation of C . When an edge (x, y) 
is labeled R (C((x,y)) = R), it represents a model in which the tuple cor- 
responding with (x, y) is in the interpretation of R. A node y is called an 
ill-successor of a node x if there is an edge (x, y) labeled R, x is called the 
predecessor of y if y is an /^-successor of x, and x is called an ancestor of 
y if x is the predecessor of y or there exists some node z such that z is 
the predecessor of y and x is an ancestor of z. A contradiction or clash is 
detected when {C, ~^C} C C(x) for some concept C and some node x. 

To test the satisfiability of a concept D. a basic algorithm initializes a 
tree to contain a single node x (called the root node) with C(x) = {D}, and 
then expands the tree by applying rules that either extend node labels or 
add new leaf nodes. A set of expansion rules for the ACC Description Logic 
is shown in Figure 9.1, where C and D are concepts, and R is a role. Note 
that: 


• Concepts are assumed to be in negation normal form, that is with negations only 

applying to concept names. Arbitrary ACC concepts can be converted to negation 
normal form by pushing negations inwards using a combination of De Morgan’s 
laws and the equivalences -i(3 R.C) <£=>■ (VI?. ->C) and ->(Vi?.C) -f=>- (31?. ->(7). 
This procedure can be extended to more expressive logics by using additional 
equivalences such as n R ) (n + 1 )R). 

• Disjunctive concepts (CUD) £ C(x) give rise to non-deterministic expansion. 
In practice this is usually dealt with by search: trying each possible expansion 
in turn until a fully expanded and clash-free tree is found, or all possibilities 
have been shown to lead to contradictions. In more expressive logics other 
constructs, such as maximum number restrictions (^nR), also lead to non- 
deterministic expansion. Searching non-deterministic expansions is the main 
cause of intractability in tableau subsumption testing algorithms. 
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• Existential role restriction concepts 3R.C G C(x) cause the creation of new 
R-successor nodes, and universal role restriction concepts VR.C G T(x) extend 
the labels of R-successor nodes. 

The tree is fully expanded when none of the expansion rules can be ap- 
plied. If a fully expanded and clash-free tree can be found, then the algorithm 
returns satisfiable ; otherwise it returns unsatisfiable. 

More expressive logics may require several extensions to this basic for- 
malism. For example, with logics that include both role inclusion axioms 
and some form of cardinality restriction, it may be necessary to label edges 
with sets of role names instead of a single role name [Horrocks, 1998b]. It 
may also be necessary to add cycle detection (often called blocking) to the 
preconditions of some of the inference rules in order to guarantee termina- 
tion [Buchheit et al, 1993a; Baader et al . , 1996], the general idea being to 
stop the expansion of a branch whenever the same node label recurs in the 
branch. Blocking can also lead to a more complex correspondence between 
the structure created by the algorithm and a model of a satisfiable concept, 
as the model may contain cycles or even be infinite [Horrocks and Sattler, 
1999]. 


9.4 Theory versus practice 

So far, what we have seen is typical of theoretical presentations of tableau- 
based decision procedures. Such a presentation is sufficient for soundness 
and completeness proofs, and is an essential starting point for the imple- 
mentation of a reliable subsumption-testing algorithm. However, there often 
remains a considerable gap between the theoretical algorithm and an actual 
implementation. Additional points which may need to be considered are: 

• the efficiency of the algorithm, in the theoretical (worst-case) sense; 

• the efficiency of the algorithm, in a practical (typical-case) sense; 

• how to use the algorithm for reasoning with unfoldable, general and cyclical KBs; 

• optimizing the (implementation of the) algorithm to improve the typical-case 
performance. 

In the remainder of this section we will consider the first three points, while 
in the following section we will consider implementation and optimization 
techniques in detail. 


9-4- 1 Worst-case complexity 

When considering an implementation, it is sensible to start with an algo- 
rithm that is known to be theoretically efficient, even if the implementation 
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3V-rule 


if 1. 3 R.C 6 C(x) 

2. there is no y s.t. C((x,y)) = R and C 6 C(y) 

3. neither the n-rule nor the U-rule is applicable to C(x) 
then create a new node y and edge ( x , y) 

with C(y) = {C} U {D \ VR.D E £(i)} and C((x,y)) = R 


Fig. 9.2. Combined 3V-rule for ACC. 


subsequently departs from the theory to some extent. Theoretically effi- 
cient is taken to mean that the complexity of the algorithm is equal to the 
complexity of the satisfiability problem for the logic, where this is known, 
or at least that consideration has been given to the worst-case complexity 
of the algorithm. This is not always the case, as the algorithm may have 
been designed to facilitate a soundness and completeness proof, with little 
consideration having been given to worst-case complexity, much less imple- 
mentation. 

Apart from establishing an upper bound for the “hardness” of the prob- 
lem, studies of theoretical complexity can suggest useful implementation 
techniques. For example, a study of the complexity of the satisfiability prob- 
lem for ACC concepts with respect to a general KB has demonstrated that 
caching of intermediate results is required in order to stay in ExpTime 
[Donini et al, 1996a], while studying the complexity of the satisfiability 
problem for STM concepts has shown that a more sophisticated labeling 
and blocking strategy can be used in order to stay in PSpace [Horrocks 
et al ., 1999]. 

One theoretically derived technique that is widely used in practice is the 
trace technique. This is a method for minimizing the amount of space used 
by the algorithm to store the tableau expansion tree. The idea is to impose 
an ordering on the application of expansion rules so that local propositional 
reasoning (finding a clash-free expansion of conjunctions and disjunctions 
using the n-rule and U-rule) is completed before new nodes are created using 
the 3-rule. A successor created by an application of the 3-rule, and any 
possible applications of the V-rule, can then be treated as an independent 
subproblem that returns either satisfiable or unsatisfiable, and the space 
used to solve it can be re-used in solving the next subproblem. A node 
x returns satisfiable if there is a clash-free propositional solution for which 
any and all subproblems return satisfiable ; otherwise it returns unsatisfiable. 
In algorithms where the trace technique can be used, the V-rule is often 
incorporated in the 3-rule, giving a single rule as shown in Figure 9.2. 

Apart from minimizing space usage, the trace technique is generally 
viewed as a sensible way of organizing the expansion and the flow of con- 
trol within the algorithm. Ordering the expansion in this way may also be 
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required by some blocking strategies [Buchheit et al, 1993a], although in 
some cases it is possible to use a more efficient subset blocking technique 
that is independent of the ordering [Baader et al, 1996]. 

The trace technique relies on the fact that node labels are not affected 
by the expansion of their successors. This is no longer true when the logic 
includes inverse roles, because universal value restrictions in the label of a 
successor of a node x can augment jC(x). This could invalidate the existing 
propositional solution for jC(x), or invalidate previously computed solutions 
to subproblems in other successor nodes. For example, if 

jC(x) = { 3R.C,3S.(VS~.(VR.^C ))}, 

then x is obviously unsatisfiable as expanding 3S.(\/S~ .(VR.^C)) will add 
yR.—'C to C(x), meaning that x must have an R - successor whose label con- 
tains both C and —<C. The contradiction would not be discovered if the 
ii-successor required by 3R.C £ C,{x) were generated first, found to be sat- 
isfiable and then deleted from the tree in order to save space. 

The development of a PSpace algorithm for the STM logic has shown 
that a modified version of the trace technique can still be used with logics 
that include inverse roles [Horrocks et al, 1999]. However, the modification 
requires that the propositional solution and all subproblems are re-computed 
whenever the label of a node is augmented by the expansion of a universal 
value restriction in the label of one of its successors. 


9.4-2 Typical-case complexity 

Although useful practical techniques can be derived from the study of the- 
oretical algorithms, it should be borne in mind that minimizing worst-case 
complexity may require the use of techniques that clearly would not be 
sensible in typical cases. This is because the kinds of pathological problem 
that would lead to worst-case behavior do not seem to occur in realistic ap- 
plications. In particular, the amount of space used by algorithms does not 
seem to be a practical problem, whereas the time taken for the computation 
certainly is. For example, in experiments with the FaCT system using the 
DL’98 test suite, available memory (200Mb) was never exhausted in spite of 
the fact that some single computations required hundreds of seconds of CPU 
time [Horrocks and Patel-Schneider, 1998b]. In other experiments using the 
Galen KB, computations were run for tens of thousands of seconds of CPU 
time without exhausting available memory. 

In view of these considerations, techniques that save space by recomput- 
ing are unlikely to be of practical value. The modified trace technique used 
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in the PSpace STM algorithm (see Subsection 9.4.1), for example, is prob- 
ably not of practical value. However, the more sophisticated labeling and 
blocking strategy, which allows the establishment of a polynomial bound on 
the length of branches, could be used not only in an implementation of the 
STM algorithm, but also in implementations of more expressive logics where 
other considerations mean that the PSpace result no longer holds [Horrocks 
et al, 1999]. 

In practice, the poor performance of tableau algorithms is due to non- 
determinism in the expansion rules (for example the U-rule), and the result- 
ing search of different possible expansions. This is often treated in a very 
cursory manner in theoretical presentations. For soundness and completeness 
it is enough to prove that the search will always find a solution if one exists, 
and that it will always terminate. For worst-case complexity, an upper bound 
on the size of the search space is all that is required. As this upper bound 
is invariably exponential with respect to the size of the problem, exploring 
the whole search space would inevitably lead to intractability for all but 
the smallest problems. When implementing an algorithm it is therefore vital 
to give much more careful consideration to non-deterministic expansion, in 
particular how to reduce the size of the search space and how to explore it in 
an efficient manner. Many of the optimizations discussed in subsequent sec- 
tions will be aimed at doing this, for example by using absorption to localize 
non- determinism in the KB, dependency directed backtracking to prune the 
search tree, heuristics to guide the search, and caching to avoid repetitive 
search. 


9.4-3 Reasoning with a knowledge base 

One area in which the theory and practice diverge significantly is that of 
reasoning with respect to the axioms in a KB. This problem is rarely consid- 
ered in detail: with less expressive logics the KB is usually restricted to being 
unfoldable, while with more expressive logics, all axioms can be treated as 
general axioms and dealt with via internalization. In either case it is suffi- 
cient to consider an algorithm that tests the satisfiability of a single concept, 
usually in negation normal form. 

In practice, it is much more efficient to retain the structure of the 
KB for as long as possible, and to take advantage of it during subsump- 
tion/satisfiability testing. One way in which this can be done is to use lazy 
unfolding - only unfolding concepts as required by the progress of the sub- 
sumption or satisfiability testing algorithm [Baader et al., 1992a]. With a 
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tableau algorithm, this means that a defined concept A is only unfolded when 
it occurs in a node label. For example, if T contains the non-primitive defi- 
nition axiom A = C, and the ll-rule is applied to a concept (AnD) 6 C(x) 
so that A and D are added to C(x), then at this point A can be unfolded by 
replacing it with C. 

Used in this way, lazy unfolding already has the advantage that it avoids 
unnecessary unfolding of irrelevant subconcepts, either because a contra- 
diction is discovered without fully expanding the tree, or because a non- 
deterministic expansion choice leads to a complete and clash-free tree. How- 
ever, a much greater increase in efficiency can be achieved if, instead of 
replacing concept names with their definitions, names are retained when 
their definitions are added. This is because the discovery of a clash between 
concept names can avoid expansion of their definitions [Baader et al, 1992a]. 

In general, lazy unfolding can be described as additional tableau expansion 
rules, defined as follows: 


[/i-rule 

if 1 . 

A g C{x) and (A = C) & T 


2 . 

c i c{x) 


then 

C(x) — > £{x) U {C} 

[/ 2 -rule 

if 1 . 

-■A G £(x) and (A = C)eT 


2 . 

- C i C{x) 


then 

C(x) — > £(x) U {—■<//} 

[/ 3 -rule 

if 1 . 

A G C(x) and (A C C) G T 


2 . 

c i c{x) 


then 

C(x) — > £(x) U {C}. 


The t/i-rule and L^-rule reflect the symmetry of the equality relation in the 
non-primitive definition A = C, which is equivalent toACC and ->A C —>C. 
The [/ 3 -rule on the other hand reflects the asymmetry of the subsumption 
relation in the primitive definition ACC*. 

Treating all the axioms in the KB as general axioms, and dealing with 
them via internalization, is also highly inefficient. For example, if T contains 
an axiom A CC, where A is a concept name not appearing on the left-hand 
side of any other axiom, then it is easy to deal with the axiom using the 
lazy unfolding technique, simply adding C to the label of any node in which 
A appears. Treating all axioms as general axioms would be equivalent to 
applying the following additional tableau expansion rules: 



346 


I. Horrocks 


Ji-rule 

if 1 . 

(■ C = D)eT 


2 . 

(D U -,C) i £{x) 


then 

£{x) - 

— > C{x) U {(D U -iC)} 

/ 2 -rule 

if 1 . 

(< C = D)eT 


2 . 

(-- D U C) 0 C(x) 


then 

£{x) - 

— t C{x) U {(-iD LI C)} 

/ 3 -rule 

if 1 . 

(C^D)eT 


2 . 

(D U -,C) i C{x) 


then 

C{x) - 

— t £(x) U {(D LI -iC)} 


With (A C C) £ T, this would result in the disjunction ( C U — >A) being 
added to the label of every node, leading to non-deterministic expansion 
and search, the main cause of empirical intractability. 

The solution to this problem is to divide the KB into two components, 
an unfoldable part T u and a general part T g . such that T g = T \ T U , and 
T u contains unique, acyclic, definition axioms. This is easily achieved, e.g., 
by initializing T a to 0 (which is obviously unfoldable), then for each axiom 
X in T, adding X to T a if T u U X is still unfoldable, and adding X to T g 
otherwise . 3 It is then possible to use lazy unfolding to deal with T u , and 
internalization to deal with T g . 

Given that the satisfiability-testing algorithm includes some sort of cycle 
checking, such as blocking, then it is even possible to be a little less conser- 
vative with respect to the definition of T u by allowing it to contain cyclic 
primitive definition axioms, for example axioms of the form A C 3R. A. Lazy 
unfolding will ensure that A 1 C 3R.A 1 by adding 3R.A to every node con- 
taining A, while blocking will take care of the non-termination problem that 
such an axiom would otherwise cause [Horrocks, 1997b]. Moreover, multiple 
primitive definitions for a single name can be added to T u , or equivalently 
merged into a single definition using the equivalence 

(A c Ci), . . . , (A c c n ) <=► AE^n-ng. 

However, if T u contains a non-primitive definition axiom A = C, then it 
cannot contain any other definitions for A, because this would be equivalent 
to allowing general axioms in T u . For example, given a general axiom CCD, 
this could be added to T u as A C D and A = C, where A is a new name not 
appearing in T . Moreover, certain kinds of non-primitive cycles cannot be 
allowed as they can be used to constrain possible models a way that would 
not be reflected by unfolding. For example, if (A = -iA) £ T for some concept 

3 Note that the result may depend on the order in which the axioms in T are processed. 
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name A, then the domain of all valid interpretations of T must be empty, 
and T j= C C D for all concepts C and D [Horrocks and Tobies, 2000]. 


9.5 Optimization techniques 

The Kris system demonstrated that by taking a well-designed tableau al- 
gorithm, and applying some reasonable implementation and optimization 
techniques (such as lazy expansion), it is possible to obtain a tableau-based 
DL system that behaves reasonably well in typical cases, and compares fa- 
vorably with systems based on structural algorithms [Baader et al, 1992a]. 
However, this kind of system is still much too slow to be usable in many 
realistic applications. Fortunately, it is possible to achieve dramatic improve- 
ments in typical-case performance by using a wider range of optimization 
techniques. 

As DL systems are often used to classify a KB, a hierarchy of optimization 
techniques is naturally suggested based on the stage of the classification 
process at which they can be applied. 

1. Pre-processing optimizations that try to modify the KB so that classification 
and subsumption testing are easier. 

2. Partial ordering optimizations that try to minimize the number of subsumption 
tests required in order to classify the KB. 

3. Subsumption optimizations that try to avoid performing a potentially expen- 
sive satisfiability test, usually by substituting a cheaper test. 

4. Satisfiability optimizations that try to improve the typical-case performance 
of the underlying satisfiability tester. 


9.5.1 Pre-processing optimizations 

The axioms that constitute a DL KB may have been generated by a hu- 
man knowledge engineer, as is typically the case in ontological engineering 
applications, or be the result of some automated mapping from another 
formalism, as is typically the case in DB schema and query reasoning appli- 
cations. In either case it is unlikely that a great deal of consideration was 
given to facilitating the subsequent reasoning procedures; the KB may, for 
example, contain considerable redundancy and may make unnecessary use 
of general axioms. As we have seen, general axioms are costly to reason with 
due to the high degree of non-determinism that they introduce. 

It is, therefore, useful to preprocess the KB, applying a range of syntactic 
simplifications and manipulations. The first of these, normalization, tries to 
simplify the KB by identifying syntactic equivalences, contradictions and 
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tautologies. The second, absorption, tries to eliminate general axioms by 
augmenting definition axioms. 

9. 5. 1.1 Normalization 

In realistic KBs, at least those manually constructed, large and complex 
concepts are seldom described monolithically, but are built up from a hi- 
erarchy of named concepts whose descriptions are less complex. The lazy 
unfolding technique described above can use this structure to provide more 
rapid detection of contradictions. 

The effectiveness of lazy unfolding is greatly increased if a contradiction 
between two concepts can be detected whenever one is syntactically equiv- 
alent to the negation of the other; for example, we would like to discover a 
direct contradiction between (C n D) and {—'D U —>C). This can be achieved 
by transforming all concepts into a syntactic normal form, and by directly 
detecting contradictions caused by non-atomic concepts as well as those 
caused by concept names. 

In Description Logics there is often redundancy in the set of concept- 
forming operators. In particular, logics with full negation often provide 
pairs of operators, either one of which can be eliminated in favor of the 
other by using negation. Conjunction and disjunction operators are an 
example of such a pair, and one can be eliminated in favor of the other using 
De Morgan’s laws. In syntactic normal form, all concepts are transformed 
so that only one of each such pair appears in the KB (it does not matter 
which of the two is chosen, the important thing is uniformity). In ACC, 
for example, all concepts could be transformed into (possibly negated) 
value restrictions, conjunctions and atomic concept names, with (->Z) U —> C) 
being transformed into —>(D\lC). An important refinement is to treat 
conjunctions as sets (written n{Ci, . . . , C n }) so that reordering or repeating 
the conjuncts does not affect equivalence; for example, (D n C) would 
be normalized as n {C,D}. 4 Normalization can also include a range of 
simplifications so that syntactically obvious contradictions and tautologies 
are detected; for example, 3R.A. could be simplified to _L. 

Figure 9.3 describes normalization and simplification functions Norm and 
Simp for ACC. These can be extended to deal with more expressive logics 
by adding appropriate normalizations (and possibly additional simplifica- 
tions). For example, number restrictions can be dealt with by adding the 
normalizations Norm(^ nit!) = (n+ l)i? and Norm(^ nR ) = ^ nR, and 

the simplification Simp(^ OR) = T. 

4 Sorting the elements in conjuctions, and eliminating duplicates, achieves the same result. 
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Norm(A) 
NormNC) 
Norm(Ci n . . . n C n ) 
Norm(Ci U . . . U C n ) 
Norm(V-R.C') 
Norm(3_R.C') 


A for atomic concept name A 
Simp(-n Norm(C)) 

Simp(n{Norm(Ci)} U • • • U {Norm(C„)}) 
Norm(-’(^C'i n • ■ • n -iC„)) 

Simp(Vf?. Norm(C)) 

Norm(^Vf?.^C) 


Simp(A) 


Simp^C) 


Simp(nS) 


Simp(V-R.C) 


A for atomic concept name A 
( X if C = T 

It if c — l 

\ Simp(D) if C = -iD 

I —iC otherwise 


_L 

if j_ e s 


if {C, C } C s 

T 

if S = 0 

Simp(S\{T}) 

if Te s 

Simp(nP u S \ {n{P}}) 

if n{P} s s 

ns 

otherwise 

T if C = T 

MR.C otherwise 



Fig. 9.3. Normalization and simplification functions for ACC. 


Normalized and simplified concepts may not be in negation normal form, 
but they can be dealt with by treating them exactly like their non-negated 
counterparts. For example, — ■n{C f , D} can be treated as (-■ C U ->D) and 
—<VR.C can be treated as 3R.—>C. In the remainder of this chapter we will 
use both forms interchangeably, choosing whichever is most convenient. 

Additional simplifications would clearly be possible. For example, Vi ?. C n 
\/R.D could be simplified to Vi?. Norm(C' n D). Which simplifications it is 
sensible to perform is an implementation decision that may depend on a 
cost-benefit analysis with respect to some particular application. Empir- 
ically, simplification seems to be more effective with mechanically gener- 
ated KBs and satisfiability problems, in particular those where the num- 
ber of different roles is very small. With this kind of problem it is quite 
common for satisfiability tests to be greatly simplified, or even completely 
avoided, by simplifying part or all of the concept to either T or _L. 
In the benchmark tests used for the Tableaux’98 comparison of modal 
logic theorem provers, for example, some classes of problems can be com- 
pletely solved via this mechanism [Heuerding and Schwendimann, 1996; 
Balsiger and Heuerding, 1998]. 

If the subsumption-testing algorithm is to derive maximum benefit from 
normalization, it is important that it directly detects contradictions caused 
by non-atomic concepts as well as those caused by concept names; for ex- 
ample the occurrence of both n{C, D} and -injC, D} in a node label should 
be detected as a contradiction without the need for further expansion. This 
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can be achieved by replacing all equivalent (identically encoded) non-atomic 
concepts C in the KB with a new atomic concept name A, and adding the 
axiom A = C to the KB. For example, all occurrences of n{C, D} in a KB 
could be replaced with CD, and the axiom CD = n{C, D} added to the KB. 

It is necessary to distinguish these newly introduced system names from 
user names appearing in the original KB, as system names need not be 
classified (indeed, it would be very confusing for the user if they were). In 
practice, it is often more convenient to avoid this problem by using pointer 
or object identifiers to refer to concepts, with the same identifier always 
being associated with equivalent concepts. A contradiction is then detected 
whenever a pointer/identifier and its negation occur in a node label. 

The advantages of the normalization and simplification procedure are: 

• It is easy to implement and could be used with most logics and algorithms. 

• Subsumption/satisfiability problems can often be simplified, and sometimes even 
completely avoided, by detecting syntactically obvious satisfiability and unsatis- 
fiability. 

• It complements lazy unfolding and improves early clash detection. 

• The elimination of redundancies and the sharing of syntactically equivalent struc- 
tures may lead to the KB being more compactly stored. 

The disadvantages are: 

• The overhead involved in the procedure, although this is relatively small. 

• For very unstructured KBs there may be no benefit, and it might even slightly 
increase the size of the KB. 


9. 5. 1.2 Absorption 

As we have seen in Subsection 9.4.3, general axioms are costly to reason 
with due to the high degree of non-determinism that they introduce. With 
a tableau algorithm, a disjunction is added to the label of each node for 
each general axiom in the KB. This leads to an exponential increase in the 
search space as the number of nodes and axioms increases. For example, 
with 10 nodes and a KB containing 10 general axioms there are already 100 
disjunctions, and they can be non-deterministically expanded in 2 100 differ- 
ent ways. For a KB containing large numbers of general axioms (there are 
1,214 in the Galen medical terminology KB) this can degrade performance 
to the extent that subsumption testing is effectively non-terminating. 

It therefore makes sense to eliminate general axioms from the KB when- 
ever possible. Absorption is a technique that tries to do this by absorbing 
them into primitive definition axioms. The basic idea is that a general axiom 
of the form CCD, where C may be a non-atomic concept, is manipulated 
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Ci n C 2 E D Ci c D u ->C 2 

C C D x n D 2 C C Di and C C D 2 

Fig. 9.4. Axiom equivalences used in absorption. 


(using the equivalences in Figure 9.4) so that it has the form of a primitive 
definition A C D', where A is an atomic concept name. This axiom can then 
be merged into an existing primitive definition A C C' to give ACC'nD'. 
For example, an axiom stating that all three-sided geometric figures (i.e. , 
triangles) also have three angles 

geometric-figure n Bangles. three C Bsides. three 

could be transformed into an axiom stating that all geometric figures either 
have three sides or do not have three angles 


geometric-figure C Bsides. three U -dangles. three 


and then absorbed into the primitive definition of geometric figure 
(geometric-figure C figure) to give 

geometric-figure C figure n (Bsides. three U -dangles. three). 


Given a KB divided into an unfoldable part T u and a general part T in the 
following procedure can be used to try to absorb the axioms from T f) into 
primitive definitions in T u . First a set 7)) is initialized to be empty, and any 
axioms (C = D) E T g are replaced with an equivalent pair of axioms C IT. D 
and -i C C —>D. Then for each axiom (C C D) £ 7),: 

1. Initialize a set G = {^D 1 C} 1 representing the axiom in the form TC 
-n{-iD, C} (i.e., TCDU ->C). 

2. If for some A G G there is a primitive definition axiom (A C C) € 7„, then 
absorb the general axiom into the primitive definition axiom so that it becomes 

A E n{C,^n(G \ {A})}, 

and exit. 

3. If for some A G G there is an axiom (A = D) G T u , then replace A G G with 
D 


G — > {D} UG \ {A}, 

and return to step 2. 

4. If for some ^A G G there is an axiom (A = D) G T u , then replace G G 
with — 1) 

G — ► {~< D } UG \ { _, A}, 


and return to step 2. 
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5. If there is some C £ G such that C is of the form nS, then use associativity 
to simplify G 

G — *SUG\{nS}, 

and return to step 2. 

6. If there is some C £ G such that C is of the form ->nS, then for every D £ S 
try to absorb (recursively) 

{^D} U G \ {HIS}, 

and exit. 

7. Otherwise, the axiom could not be absorbed, so add ->nG to T g 

T a — > T a U -HG, 

and exit. 

Note that this procedure allows parts of axioms to be absorbed. For example, 
given axioms (A C D i) £ T u and (A U 3R.C C D2) £ T g , then the general 
axiom would be partly absorbed into the definition axiom to give (A C 
(D\ n D2)) £ T u , leaving a smaller general axiom (— >n{— 'Zi>2 , 372.(7}) G T g . 

When this procedure has been applied to all the axioms in T g , then T g 
represents those (parts of) axioms that could not be absorbed. The axioms 
in Tg are already in the form T C C, so that T\T g is the concept that must 
be added to every node in the tableau expansion. This can be done using a 
universal role, as described in Subsection 9.2.4, although in practice it may 
be simpler just to add the concept to the label of each newly created node. 

The absorption process is clearly non-deterministic. In the first place, 
there may be more than one way to divide T into unfoldable and gen- 
eral parts. For example, if T contains multiple non-primitive definitions for 
some concept A, then one of them must be selected as a definition in T u 
while the rest are treated as general axioms in T g . Moreover, the absorp- 
tion procedure itself is non-deterministic as G may contain more than one 
primitive concept name into which the axiom could be absorbed. For ex- 
ample, in the case where {Ai, A2} = G, and there are two primitive def- 
inition axioms Aj C C and A2 C D in T u , then the axiom could be ab- 
sorbed either into the definition of Ai to give Ai C C I~1 -■n{A 2 } (equiva- 
lent to Ai C C n -1A2) or into the definition of /\o to give A2 C C n -in{Ai} 
(equivalent to A2 C C n -1A1). 

It would obviously be sensible to choose the “best” absorption (the one 
that maximized empirical tractability), but it is not clear how to do this - 
in fact it is not even clear how to define “best” in this context [Horrocks 
and Tobies, 2000]. If T contains more than one definition axiom for a given 
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concept name, then empirical evidence suggests that efficiency is improved 
by retaining as many non-primitive definition axioms in T u as possible. An- 
other intuitively obvious possibility is to preferentially absorb into the defini- 
tion axiom of the most specific primitive concept, although this only helps in 
the case that Ai C /\o or A 2 C Aj. Other more sophisticated schemes might 
be possible, but have yet to be investigated. 

The advantages of absorption are: 

• It can lead to a dramatic improvement in performance. For example, without 
absorption, satisfiability of the Galen KB (i.e., the satisfiability of T) could not 
be proved by either FaCT or Dlp, even after several weeks of CPU time. After 
absorption, the problem becomes so trivial that the CPU time required is hard 
to measure. 

• It is logic and algorithm independent. 

The disadvantage is the overhead required for the pre-processing, although 
this is generally small compared to classification times. However, the proce- 
dure described is almost certainly suboptimal, and trying to find an optimal 
absorption may be much more costly. 


9.5.2 Optimizing classification 

DL systems are often used to classify a KB, that is to compute a partial 
ordering or hierarchy of named concepts in the KB based on the subsump- 
tion relationship. As subsumption testing is always potentially costly, it is 
important to ensure that the classification process uses the smallest possible 
number of tests. Minimizing the number of subsumption tests required to 
classify a concept in the concept hierarchy can be treated as an abstract 
order-theoretic problem which is independent of the ordering relation. How- 
ever, some additional optimization can be achieved by using the structure 
of concepts to reveal obvious subsumption relationships and to control the 
order in which concepts are added to the hierarchy (where this is possible). 

The concept hierarchy is usually represented by a directed acyclic graph 
where nodes are labeled with sets of concept names (because multiple con- 
cept names may be logically equivalent), and edges correspond with sub- 
sumption relationships. The subsumption relation is both transitive and 
reflexive, so a classified concept A subsumes a classified concept B if either: 

1. both A and B are in the label of some node x, or 

2. A is in the label of some node x, there is an edge ( x,y } in the graph, and the 
concept (s) in the label of node y subsume B. 
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It will be assumed that the hierarchy always contains a top node (a node 
whose label includes T) and a bottom node (a node whose label includes _L) 
such that the top node subsumes the bottom node. If the KB is unsatisfiable 
then the hierarchy will consist of a single node whose label includes both T 
and _L. 

Algorithms based on traversal of the concept hierarchy can be used to min- 
imize the number of tests required in order to add a new concept [Baader 
et al, 1992a]. The idea is to compute a concept’s subsumers by searching 
down the hierarchy from the top node (the top search phase) and its sub- 
surnees by searching up the hierarchy from the bottom node (the bottom 
search phase). 

When classifying a concept A, the top search takes advantage of the tran- 
sitivity of the subsumption relation by propagating failed results down the 
hierarchy. It concludes, without performing a subsumption test, that if A is 
not subsumed by B, then it cannot be subsumed by any other concept that 
is subsumed by B: 

T y= A C B and T |= B' C B implies T y= A C B'. 

To maximize the effect of this strategy, a modified breadth-first search is 
used [Ellis, 1992] which ensures that a test to discover if B subsumes A is 
never performed until it has been established that A is subsumed by all of 
the concepts known to subsume B. 

The bottom search uses a corresponding technique, testing if A subsumes 
B only when A is already known to subsume all those concepts that are 
subsumed by B. Information from the top search is also used by confining the 
bottom search to those concepts which are subsumed by all of A’s subsumers. 

This abstract partial ordering technique can be enhanced by taking ad- 
vantage of the structure of concepts and the axioms in the KB. If the KB 
contains an axiom ACC 1 or A = C, then C is said to be a told subsumer 
of A. If C is a conjunctive concept (Ci n • • • n C n ), then from the structural 
subsumption relationship 

D C (Ci n • • • n C n ) implies d C C) and . . . and D C C n 

it is possible to conclude that C\, . . . ,C n are also told subsumers of A. More- 
over, due to the transitivity of the subsumption relation, any told subsumers 
of Ci, . . . , C n are also told subsumers of A. Before classifying A, all of its 
told subsumers which have already been classified, and all their subsumers, 
can be marked as subsumers of A; subsumption tests with respect to these 
concepts are therefore rendered unnecessary. This idea can be extended in 
the obvious way to take advantage of a structural subsumption relationship 
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with respect to disjunctive concepts, 

(Ci U • • • LI C n ) E D implies Ci E D and . . . and C n E D. 

If the KB contains an axiom A = C and C is a disjunctive concept (Ci U 

• • • U C n ), then A is a told subsumer of Ci, . . . , C n . 

To maximize the effect of the told subsumer optimization, concepts should 
be classified in definition order. This means that a concept A is not classified 
until all of its told subsumers have been classified. When classifying an 
unfoldable KB, this ordering can be exploited by omitting the bottom search 
phase for primitive concept names and assuming that they only subsume 
(concepts equivalent to) _L. This is possible because, with an unfoldable 
KB, a primitive concept can only subsume concepts for which it is a told 
subsumer. Therefore, as concepts are classified in definition order, a primitive 
concept will always be classified before any of the concepts that it subsumes. 
This additional optimization cannot be used with a general KB because, 
in the presence of general axioms, it can no longer be guaranteed that a 
primitive concept will only subsume concepts for which it is a told subsumer. 
For example, given a KB T such that 

T = { A: 3R.C , 3R.C E B}, 

then B is not a told subsumer of A, and A may be classified first. However, 
when B is classified the bottom search phase will discover that it subsumes 
A due to the axiom 3R.C E B. 

The advantages of the enhanced traversal classification method are: 

• It can significantly reduce the number of subsumption tests required in order to 
classify a KB [Baader et al . , 1992a]. 

• It is logic and (subsumption) algorithm independent. 

There appear to be few disadvantages to this method, and it is used (in 
some form) in most implemented DL systems. 


9.5.3 Optimizing subsumption testing 

The classification optimizations described in Subsection 9.5.2 help to reduce 
the number of subsumption tests that are performed when classifying a KB, 
and the combination of normalization, simplification and lazy unfolding fa- 
cilitates the detection of “obvious” subsumption relationships by allowing 
unsatisfiability to be rapidly demonstrated. However, detecting “obvious” 
non-subsumption (satisfiability) is more difficult for tableau algorithms. This 
is unfortunate as concept hierarchies from realistic applications are typically 
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{A, -iB, C, D, 3R 1 .C 1 ,3R 2 .C 2 ,3R 3 .C 3 } 

{Ci} {C 3 } {C 2 } 

Fig. 9.5. Joining expansion trees for A and ^B. 

broad, shallow and tree-like. The top search phase of classifying a new con- 
cept A in such a hierarchy will therefore result in several subsumption tests 
being performed at each node, most of which are likely to fail. These failed 
tests could be very costly (if, for example, proving the satisfiability of A is 
a hard problem), and they could also be very repetitive. 

This problem can be tackled by trying to use cached results from pre- 
vious tableau tests to prove non-subsumption without performing a new 
satisfiability test. For example, given two concepts A and B defined by the 
axioms 


A = Cn3R 1 .Cin3R 2 .C 2 , and 
B = — iZJ U V-R 3 .— 'C 3 , 

then A is not subsumed by B if the concept A n ->B is satisfiable. If tableau 
expansion trees for A and ->B have already been cached, then the satisfiability 
of the conjunction can be demonstrated by a tree consisting of the trees for 
A and ^B joined at their root nodes, as shown in Figure 9.5 (note that 
- 'B = D n 3 R 3 .C 3 ) . 

Given two fully expanded and clash- free tableau expansion trees Ti and 
T 2 representing models of (satisfiable) concepts A and ->B respectively, the 
tree created by joining Ti and T 2 at their root nodes is a fully expanded and 
clash- free tree representing a model of A n -iB provided that the union of the 
root node labels does not contain a clash and that no tableau expansion rules 
are applicable to the new tree. For most logics, this can be ascertained by 
examining the labels of the root nodes and the labels of the edges connecting 
them with their successors. With the ACC logic for example, if x\ and x 2 are 
the two root nodes, then the new tree will be fully expanded and clash-free 
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provided that: 

1. the union of the root node labels does not contain an immediate contradiction, 
i.e., there is no C such that {C, -iC} C C(x\) U£(a; 2 ), and 

2. there is no interaction between value restrictions in the label of one root node 
and edges connecting the other root node with its successors that might make 
the V-rule applicable to the joined tree, i.e., there is no R such that \/R.C € 
C{x i) and T 2 has an edge (x 2 ,y) with C({x 2 ,y)) = R, or \/R.C € C(x 2 ) and 
Ti has an edge ( x±,y } with C((x±,y)) = R. 

With more expressive logics it may be necessary to consider other interac- 
tions that could lead to the application of tableau expansion rules. With a 
logic that included number restrictions, for example, it would be necessary 
to check that these could not be violated by the root node successors in the 
joined tree. 

It would be possible to join trees in a wider range of cases by examining the 
potential interactions in more detail. For example, a value restriction VR.C £ 
C(x i) and an R labeled edge (x 2 , y) would not make the V-rule applicable to 
the joined tree if C £ C{x 2 )- However, considering only root nodes and edges 
provides a relatively fast test and reduces the storage required by the cache. 
Both the time required by the test and the size of the cache can be reduced 
even further by only storing relevant components of the root node labels 
and edges from the fully expanded and clash-free tree that demonstrates 
the satisfiability of a concept. In the case of AjCC, the relevant components 
from a tree demonstrating the satisfiability of a concept A are the set of 
(possibly negated) atomic concept names in the root node label (denoted 
C c ( A)), the set of role names in value restrictions in the root node label 
(denoted Cy( A)), and the set of role names labeling edges connecting the 
root node with its successors (denoted £3 (A)). 5 These components can be 
cached as a triple (£ C (A), C\/(A), / 13 (A)). 

When testing if A is subsumed by B, the algorithm can now proceed as 
follows: 

1. If any of (£ C (A), £ V (A), £ 3 (A)), (£ C (^A),£ V (-A),£ 3 (-A)), (£ C (B),£ V (B), 
£ 3 (B)) or (£ C (^B), £y(^B), £ 3 (-iB)) are not in the cache, then perform the 
appropriate satisfiability tests and update the cache accordingly. In the case 
where a concept C is unsatisfiable, £ c (C) = { J_} and C c (->C ) = {T }. 

2. Conclude that AC B (An ^B is not satisfiable) if £ C (A) = { _L} or £ C (B) = 

{T}. 

3. Conclude that A g B (A n ->B is satisfiable) if 

5 Consideration can be limited to atomic concept names because expanded conjunction and 

disjunction concepts are no longer relevant to the validity of the tree, and are only retained in 

order to facilitate early clash detection. 
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(a) £ C (A) = {T} and £ c ( B) £ {T}, or 

(b) £ C (A) ^ {J_} and £ C (B) = {_L}, or 

(c) £v(A) n £ a (B) = 0, £y(B) n£ a (A) = 0, _L ^ £ C (A) U£ C (B), and there is 
no C such that { C , ->C} C £ C (A) U £ C (B). 

4. Otherwise perform a satisfiability test on A n ^B, concluding that A C B if it 
is not satisfiable and that A % B if it is satisfiable. 

When a concept A is added to the hierarchy, this procedure will result 
in satisfiability tests immediately being performed for both A and ->A. Dur- 
ing the subsequent top search phase, at each node x in the hierarchy such 
that some C £ C(x) subsumes A, it will be necessary to perform a subsump- 
tion test for each subsumee node y (unless some of them can be avoided 
by the classification optimizations discussed in Subsection 9.5.2). Typically 
only one of these subsumption tests will lead to a full satisfiability test be- 
ing performed, the rest being shown to be obvious non-subsumptions using 
the cached partial trees. Moreover, the satisfiability test that is performed 
will often be an “obvious” subsumption, and unsatisfiability will rapidly be 
demonstrated. 

The optimization is less useful during the bottom search phase as nodes in 
the concept hierarchy are typically connected to only one subsuming node. 
The exception to this is the bottom (_L) node, which may be connected to 
a very large number of subsuming nodes. Again, most of the subsumption 
tests that would be required by these nodes can be avoided by demonstrating 
non-subsumption using cached partial trees. 

The caching technique can be extended in order to avoid the construction 
of obviously satisfiable and unsatisfiable subtrees during tableau expansion. 
For example, if some leaf node x is about to be expanded, and C(x) = 
{A}, unfolding and expanding C(x) is clearly unnecessary if A is already 
known to be either satisfiable (i.e. , (£ C (A), £y( A), £ 3 (A)) is in the cache and 
C c { A) / {T}) or unsatisfiable (i.e., (£ C (A), £v(A), £a(A)) is in the cache and 
£c(A) = {T}). 

This idea can be further extended by caching (when required) partial trees 
for all the syntactically distinct concepts discovered by the normalization 
and simplification process, and trying to join cached partial trees for all the 
concepts in a leaf node’s label before starting the expansion process. For 
example, with the logic ACC and a node x such that 

C[x) = {C u ...,C n }, 

x is unsatisfiable if for some 1 ^ i ^ n, C c (Ci) = {T}, and x is satisfiable if 
for all 1 < i < n and i < j < n: 
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1. £ v (a)n£ 3 ((7 j ) = 0, 

2. C 3 {Ci) n £ v (Cj) = 0, and 

3. there is no C such that { C , ->C} C £ c (Cj) U C c (Cj). 

As before, additional interactions may need to be considered with more 
expressive logics. Moreover, with logics that support inverse roles, the effect 
that the subtree might have on its predecessor must also be considered. 
For example, if x is an i?-successor of some node y, and R~ G Cy(Ci) for 
one of the Ci G C(x), then the expanded C(x) represented by the cached 
partial trees would contain a value restriction of the form Vi? - .D that could 
augment C(y). 

The advantages of caching partial tableau expansion trees are: 

• When classifying a realistic KB, most satisfiability tests can be avoided. For 
example, the number of satisfiability tests performed by the FaCT system when 
classifying the Galen KB is reduced from 122,695 to 23,492, a factor of over 
80%. 

• Without caching, some of the most costly satisfiability tests are repeated (with 
minor variations) many times. The time saving due to caching is therefore even 
greater than the saving in satisfiability tests. 

The disadvantages are: 

• The overhead of performing satisfiability tests on individual concepts and their 
negations in order to generate the partial trees that are cached. 

• The overhead of storing the partial trees. This is not too serious a problem as the 
number of trees cached is equal to the number of named concepts in the KB (or 
the number of syntactically distinct concepts if caching is used in subproblems). 

• The overhead of determining if the cached partial trees can be merged, which is 
wasted if they cannot be. 

• Its main use is when classifying a KB, or otherwise performing many similar 
satisfiability tests. It is of limited value when performing single tests. 


9.5.4 Optimizing satisfiability testing 

In spite of the various techniques outlined in the preceding sections, at some 
point the DL system will be forced to perform a “real” subsumption test, 
which for a tableau-based system means testing the satisfiability of a con- 
cept. For expressive logics, such tests can be very costly. However, a range 
of optimizations can be applied that dramatically improve performance in 
typical cases. Most of these are aimed at reducing the size of the search 
space explored by the algorithm as a result of applying non-deterministic 
tableau expansion rules. 
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Fig. 9.6. Syntactic branching search. 

9.5-4- 1 Semantic branching search 

Standard tableau algorithms use a search technique based on syntactic 
branching. When expanding the label of a node x, syntactic branching works 
by choosing an unexpanded disjunction (C\ U • • • U C n ) in C(x) and search- 
ing the different models obtained by adding each of the disjuncts C\. . . . , 
C n to C(x) [Giunchiglia and Sebastiani, 1996b]. As the alternative branches 
of the search tree are not disjoint, there is nothing to prevent the recur- 
rence of an unsatisfiable disjunct in different branches. The resulting wasted 
expansion could be costly if discovering the unsatisfiability requires the so- 
lution of a complex subproblem. For example, tableau expansion of a node 
x, where {(A U B), (A LI C)} C C(x) and A is an unsatisfiable concept, could 
lead to the search pattern shown in Figure 9.6, in which the unsatisfiability 
of C(x) U A must be demonstrated twice. 

This problem can be dealt with by using a semantic branching technique 
adapted from the Davis-Putnam-Logemann-Loveland procedure (DPLL) 
commonly used to solve propositional satisfiability (SAT) problems [Davis 
and Putnam, 1960; Davis et ah, 1962; Freeman, 1996]. 6 Instead of choosing 
an unexpanded disjunction in C(x), a single disjunct D is chosen from one of 
the unexpanded disjunctions in C(x). The two possible subtrees obtained by 
adding either D or -> D to C(x) are then searched. Because the two subtrees 
are strictly disjoint, there is no possibility of wasted search as in syntactic 
branching. Note that the order in which the two branches are explored is 
irrelevant from a theoretical viewpoint, but may offer further optimization 
possibilities (see Subsubsection 9. 5. 4. 4). 

The advantages of semantic branching search are: 

• A great deal is known about the implementation and optimization of the DPLL 
algorithm. In particular, both local simplification (see Subsubsection 9. 5. 4. 2) and 
heuristic guided search (see Subsubsection 9. 5. 4.4) can be used to try to minimize 
the size of the search tree (although it should be noted that both these techniques 
can also be adapted for use with syntactic branching search). 


An alternative solution is to enhance syntactic branching with “no-good” lists in order to avoid 
reselecting a known unsatisfiable disjunct [Donini and Massacci, 2000]. 
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• It can be highly effective with some problems, particularly randomly generated 
problems [Horrocks and Patel-Schneider, 1999]. 

The disadvantages are: 

• It is possible that performance could be degraded by adding the negated disjunct 
in the second branch of the search tree, for example if the disjunct is a very large or 
complex concept. However this does not seem to be a serious problem in practice, 
with semantic branching rarely exhibiting significantly worse performance than 
syntactic branching. 

• Its effectiveness is problem dependent. It is most effective with randomly gen- 
erated problems, particularly those that are over-constrained (likely to be un- 
satisfiable) . It is also effective with some of the hand crafted problems from the 
Tableaux’98 benchmark suite. However, it appears to be of little benefit when 
classifying realistic KBs [Horrocks and Patel-Schneider, 1998a] . 

9. 5. 4- 2 Local simplification 

Local simplification is another technique used to reduce the size of the search 
space resulting from the application of non-deterministic expansion rules. 
Before any non-deterministic expansion of a node label C(x) is performed, 
disjunctions in C(x) are examined, and if possible simplified. The simplifi- 
cation most commonly used (although by no means the only one possible) 
is to deterministically expand disjunctions in C{x) that present only one 
expansion possibility and to detect a clash when a disjunction in C(x ) 
has no expansion possibilities. This simplification has been called Boolean 
constraint propagation (BCP) [Freeman, 1995]. In effect, the inference rules 

-'Ci, . . . , -iCVi, Ci u • • • u C n u D Ci, . . . , C n , -i Ci u • • • u -i C n u D 

„ and — 

D D 

are being used to simplify the conjunctive concept represented by C(x). 

For example, given a node x such that 

{(C U (Hi n Z? 2 )), (-i-Di LI -iZ ?2 U C ), _1 C'} C C{x), 

BCP deterministically expands the disjunction (C U (Pj n P2)), adding 
(Pi n D2) to C{x), because -1 C £ £(1). The deterministic expansion of 
(Pi n P2) adds both D 1 and D 2 to C(x), allowing BCP to identify 
(—‘Di U — 1 H 2 U C) as a clash (without any branching having occurred), be- 
cause {Di, D- 2 ,^C} C C{x). 

BCP simplification is usually described as an integral part of SAT-based 
algorithms [Giunchiglia and Sebastiani, 1996a], but it can also be used with 
syntactic branching. However, it is more effective with semantic branching as 
the negated concepts introduced by failed branches can result in additional 
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C(X) = {(AUB),(AUC)} 

u y \u 

C(x) U {A} => clash Cx) ^ x ) U B, C} 

Fig. 9.7. Semantic branching search. 

simplifications. Taking the above example of {(iUfi),(iUC)} C£(x), 
adding —>A to C(x) allows BCP to deterministically expand both of the dis- 
junctions using the simplifications {A U B) A — i A =A B and (A U C) A —> A =A 
C. The reduced search space resulting from the combination of semantic 
branching and BCP is shown in Figure 9.7. 

The advantages of local simplification are: 

• It is applicable to a wide range of logics and algorithms. 

• It can never increase the size of the search space. 

The disadvantages are: 

• It may be costly to perform without using complex data structures [Freeman, 
1995], 

• Its effectiveness is relatively limited and problem dependent. It is most effective 
with randomly generated problems, particularly those that are over-constrained 
[Horrocks and Patel-Schneider, 1998a]. 

9. 5. 4- 3 Dependency directed backtracking 
Inherent unsatisfiability concealed in subproblems can lead to large amounts 
of unproductive backtracking search, sometimes called thrashing. The prob- 
lem is exacerbated when blocking is used to guarantee termination, because 
blocking may require that subproblems only be explored after all other forms 
of expansion have been performed. For example, expanding a node x (using 
semantic branching), where 

C{x) = {(<?! U Di), ...,(C n U D n ), 3R.(A n B),\/R.-iA}, 

could lead to the fruitless exploration of 2 n possible ii-successors of x before 
the inherent unsatisfiability is discovered (note that if C(x) simply included 
3R.A instead of 3R.(A n B), then the inherent unsatisfiability would have 
been detected immediately due to the normalization of 3R.A as -iV-R.-i-A). 
The search tree resulting from the tableau expansion is illustrated in Fig- 
ure 9.8. 

This problem can be addressed by identifying the causes of clashes, and 
using this information to prune or restructure the search space - a technique 
known as dependency directed backtracking. The form most commonly used 
in practice, called backjumping, is adapted from a technique that has been 
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used in solving constraint satisfiability problems [Baker, 1995] (a similar 
technique was also used in the Harp theorem prover [Oppacher and Suen, 
1988]). Backjumping works by labeling each concept in a node label and 
each role in an edge label with a dependency set indicating the branching 
points on which it depends. A concept C G C(x) depends on a branching 
point if C was added to C(x) at the branching point or if C depends on 
another concept D (or role R), and D (or R) depends on the branching 
point. A concept C G C(x) depends on a concept D (or role R) when C 
was added to C(x) by the application of a deterministic expansion rule that 
used D (or I?); a role R = C({x , y)) depends on a concept D when (x, y) was 
labeled R by the application of a deterministic expansion rule that used D. 
For example, if A € C(y) was derived from the expansion of Vi?. A G C(x), 
then A G C(y) depends on both MR. A G C(x) and R = C((x, y}). 

Labeling roles with dependency sets can be avoided in algorithms where 
a combined 3V-rule is used, as the dependency sets for concepts in the label 
of the new node can be derived in a single step. On the other hand, more 
complex algorithms and optimization techniques may lead to more complex 
dependencies. For example, if C n G C(x) was derived from a BCP simplifi- 
cation of {(Ci U • • • U C n ), -iCi, . . . , -iC n _i} C jC(x), then it depends on the 
disjunction (Ci U • • • LI C n ) and all of -iCi, . . . , -iC n _i. 

When a clash is discovered, the dependency sets of the clashing concepts 
can be used to identify the most recent branching point where exploring the 
other branch might alleviate the cause of the clash. It is then possible to jump 
back over intervening branching points without exploring any alternative 
branches. Again, more complex algorithms and optimizations may lead to 
more complex dependencies. For example, if the clash results from a BCP 
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simplification of {(Cj U • • • U C n ), ~>C \ . . . . , -iC n } C C(x), then it depends on 
the disjunction (C\ LI • • • U C n ) and all of ~>C\ , . . . , -i(7 n . 

When testing the satisfiability of a concept C, the dependency set of 
C G £(x') is initialized to 0 (the empty set) and a branching depth counter 
b is initialized to 1. The search algorithm then proceeds as follows: 

1. Perform deterministic expansion, setting the dependency set of each concept 

added to a node label and each role assigned to an edge label to the union of 

the dependency sets of the concepts and roles on which they depend. 

(a) If a clash is discovered, then return the union of the dependency sets of the 
clashing concepts. 

(b) If a clash-free expansion is discovered, then return {0}. 

2. Branch on a concept D G C(y), trying first C{y) U {D} and then C(y) U 

(a) Add D to C(y) with a dependency set {fo}, and increment b. 

(b) Set Di to the dependency set returned by a recursive call to the search 
algorithm, and decrement b. 

(c) If b ^ Di, then return Di without exploring the second branch. 

(d) If b G Di, then add -<D to C(y) with a dependency set Di \ {b} and return 
to step 1. 

If the search returns {0}, then a successful expansion was discovered and 
the algorithm returns “satisfiable” , otherwise all possible expansions led to 
a clash and “unsatisfiable” is returned. 

Let us consider the earlier example and suppose that 3R.(A n B ) has a 
dependency set D,, MR.^A has a dependency set Dj and b = k (meaning 
that there have already been k — 1 branching points in the search tree). Note 
that the largest values in D; and Dj must be less than k, as neither concept 
can depend on a branching point that has not yet been reached. 

At the fcth branching point, C\ is added to C(x) with a dependency set 
{&} and b is incremented. The search continues in the same way until the 
(k + n — l)th branching point, when C n is added to jC(x) with a dependency 
set {k + n — 1}. Next, 3R.(A n B) is deterministically expanded, generating 
an i?-successor y with R = (x, y) labeled D; and (An B) G C(y) labeled Dp 
Finally, MR.^A is deterministically expanded, adding -> A to C(y) with a label 
D.j U Dj (because it depends on both \/R.->A G C(x) and R = (x,y)). 

The expansion now continues with C(y), and (A n B ) is deterministically 
expanded, adding A and B to C(y), both labeled Dp This results in a clash 
as {A, -i A} C C(y), and the set D ; : U Dj U Dj = Dj U D j (the union of the 
dependency sets from the two clashing concepts) is returned. The algorithm 
will then backtrack through each of the preceding n branching points without 
exploring the second branches, because in each case b ^ Dj U Dj (remember 
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that the largest values in D,; and D ; are less than k), and will continue 
to backtrack until it reaches the branching point equal to the maximum 
value in D, U Dj (if D; = Dj = 0, then the algorithm will backtrack through 
all branching points and return “unsatisfiable” ) . Figure 9.9 illustrates the 
pruned search tree, with the number of i?-successors explored being reduced 
by 2 n - 1. 

Backjumping can also be used with syntactic branching, but the procedure 
is slightly more complex as there may be more than two possible choices at 
a given branching point, and the dependency set of the disjunction being 
expanded must also be taken into account. When expanding a disjunction of 
size n with a dependency set D^, the first n — 1 disjuncts are treated like the 
first branch in the semantic branching algorithm, an immediate backtrack 
occurring if the recursive search discovers a clash that does not depend on h. 
If each of these branches returns a dependency set Dj such that b £ D;, then 
the nth disjunct is added with a dependency set (Di U • • • U D„_i U D^) \ b. 
The advantages of backjumping are: 

• It can lead to a dramatic reduction in the size of the search tree and thus a 
huge performance improvement. For example, when trying to classify the Galen 
model using either FaCT or Dlp with backjumping disabled, single satisfiability 
tests were encountered that could not be solved even after several weeks of CPU 
time. 

• The size of the search space can never be increased. 

The disadvantage is the overhead of propagating and storing the depen- 
dency sets. The storage overhead can be alleviated to some extent by using 
a pointer-based implementation so that propagating a dependency set only 
requires the copying of a pointer. A simpler scheme using single maximal 
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dependency values instead of sets would also be possible, but some depen- 
dency information would be lost and this could lead to less efficient pruning 
of the search tree. 


9. 5-4-4 Heuristic guided search 

Heuristic techniques can be used to guide the search in a way that tries to 
minimize the size of the search tree. A method that is widely used in DPLL 
SAT algorithms is to branch on the disjunct that has the Maximum number 
of Occurrences in disjunctions of Minimum Size - the well-known MOMS 
heuristic [Freeman, 1995]. By choosing a disjunct that occurs frequently in 
small disjunctions, the MOMS heuristic tries to maximize the effect of BCP. 
For example, if the label of a node x contains the unexpanded disjunctions 
CUdi,...,CU D n , then branching on C leads to their deterministic ex- 
pansion in a single step: when C is added to C{x), no further expansion of 
the disjunctions is required (they are all fully expanded) and when -> C is 
added to C(x), BCP will expand all of the disjunctions, causing D i, . . . , D n 
to be added to C(x). Branching first on any of D i, . . . , D n , on the other 
hand, would only cause a single disjunction to be expanded. 

The MOMS value for a candidate concept C is computed simply by count- 
ing the number of times C or its negation occur in minimally sized disjunc- 
tions. There are several variants of this heuristic, including the heuristic 
from [Jeroslow and Wang, 1990]. The Jeroslow-Wang heuristic considers all 
occurrences of a disjunct, weighting them according to the size of the dis- 
junction in which they occur. The heuristic then selects the disjunct with 
the highest overall weighting, again with the objective of maximizing the 
effect of BCP and reducing the size of the search tree. 

When a disjunct C has been selected from the disjunctions in £(x), a 
BCP maximizing heuristic can also be used to determine the order in which 
the two possible branches, C(x) U {C} and C(x) U {-i(7}, are explored. This 
is done by separating the two components of the heuristic weighting con- 
tributed by occurrences of C and ->C, trying C(x) U {C} first if C made the 
smallest contribution, and trying C(x) U { _, C'} first otherwise. The intention 
is to prune the search tree by maximizing BCP in the first branch. 

Unfortunately, MOMS-style heuristics can interact adversely with the 
backjumping optimization because they do not take dependency informa- 
tion into account. This was first discovered in the FaCT system, when it was 
noticed that using MOMS heuristic often led to much worse performance. 
The cause of this phenomenon turned out to be the fact that, without the 
heuristic, the data structures used in the implementation naturally led to 
“older” disjunctions (those dependent on earlier branching points) being 
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expanded before “newer” ones, and this led to more effective pruning if a 
clash was discovered. Using the heuristic disturbed this ordering and reduced 
the effectiveness of backjumping [Horrocks, 1997b]. 

Moreover, MOMS-style heuristics are of little value themselves in DL sys- 
tems because they rely for their effectiveness on finding the same disjuncts 
recurring in multiple unexpanded disjunctions: this is likely in hard proposi- 
tional problems, where the disjuncts are propositional variables, and where 
the number of different variables is usually small compared to the number of 
disjunctive clauses (otherwise problems would, in general, be trivially satis- 
fiable); it is unlikely in concept satisfiability problems, where the disjuncts 
are (possibly non-atomic) concepts, and where the number of different 
concepts is usually large compared to the number of disjunctive clauses. As 
a result, these heuristics will often discover that all disjuncts have similar 
or equal priorities, and the guidance they provide is not particularly useful. 

An alternative strategy is to employ an oldest-first heuristic that tries 
to maximize the effectiveness of backjumping by using dependency sets to 
guide the expansion [Horrocks and Patel-Schneider, 1999]. When choosing 
a disjunct on which to branch, the heuristic first selects those disjunctions 
that depend on the least recent branching points (i.e., those with minimal 
maximum values in their dependency sets), and then selects a disjunct from 
one of these disjunctions. This can be combined with the use of a BCP 
maximizing heuristic, such as the Jeroslow-Wang heuristic, to select the 
disjunct from amongst the selected disjunctions. 

Although the BCP and backjumping maximizing heuristics described 
above have been designed with semantic branching in mind they can also 
be used with syntactic branching. The oldest-first heuristic actually selects 
disjunctions rather than disjuncts, and is thus a natural candidate for a 
syntactic branching heuristic. BCP maximizing heuristics could also be 
adapted for use with syntactic branching, for example by first evaluating 
the weighting of each disjunct and then selecting the disjunction whose 
disjuncts have the highest average, median or maximum weightings. 

The oldest-first heuristic can also be used to advantage when selecting 
the order in which existential role restrictions, and the labels of the R- 
successors which they generate, are expanded. One possible technique is to 
use the heuristic to select an unexpanded existential role restriction 3R.C 
from the label of a node x, apply the 3-rule and the V-rule as necessary, 
and expand the label of the resulting .R-successor. If the expansion results 
in a clash, then the algorithm will backtrack; if it does not, then continue 
selecting and expanding existential role restrictions from jC(x) until it is 
fully expanded. A better technique is to first apply the 3-rule and the V-rule 
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exhaustively, creating a set of successor nodes. The order in which to expand 
these successors can then be based on the minimal maximum values in the 
dependency sets of all the concepts in their label, some of which may be due 
to universal role restrictions in C{x). 

The advantages of using heuristics are: 

• They can be used to complement other optimizations. The MOMS and Jeroslow- 
Wang heuristics, for example, are designed to increase the effectiveness of BCP 
while the oldest-first heuristic is designed to increase the effectiveness of back- 
jumping. 

• They can be selected and tuned to take advantage of the kinds of problem that are 
to be solved (if this is known). The BCP maximization heuristics, for example, are 
generally quite effective with large randomly generated and hand crafted prob- 
lems, whereas the oldest-first heuristic seems to be more effective when classifying 
realistic KBs. 

The disadvantages are: 

• They can add a significant overhead as the heuristic function may be expensive 
to evaluate and may need to be re-evaluated at each branching point. 

• They may not improve performance, and may significantly degrade it. 

— Heuristics can interact adversely with other optimizations, as was the case with 
the MOMS heuristic and backjumping in the FaCT system. 

— When they work badly, heuristics can increase the frequency with which patho- 
logical worst cases can be expected to occur. For example, with problems that 
are highly disjunctive but relatively under-constrained, using a BCP maxi- 
mizing heuristic to select highly constraining disjuncts can force backtracking 
search to be performed when most random branching choices would lead rapidly 
to a clash- free expansion. 

— The cost of computing the heuristic function can outweigh the benefit (if any) . 

• Heuristics designed to work well with purely proposition reasoning, such as the 
BCP maximizing heuristics, may not be particularly effective with Description 
Logics, where much of the reasoning is modal (it involves roles and subprob- 
lems). There has been little work on finding good heuristics for modal reasoning 
problems. 


9. 5-4 -5 Caching satisfiability status 

During a satisfiability check there may be many successor nodes created. 
Some of these nodes can be very similar, particularly as the labels of the 
i?-successors for a node x each contain the same concepts derived from 
the universal role restrictions in C{x). Considerable time can thus be spent 
re-conrputing the satisfiability of nodes that have the same label. As the sat- 
isfiability algorithm only needs to know whether a node is satisfiable or not, 
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this time is wasted. Moreover, when classifying a KB, similar satisfiability 
tests may be performed many times, and may provide further opportunities 
for the re-use of satisfiability results for node labels if these are retained 
across multiple concept satisfiability tests. 

If the expansion of existential value restrictions in the label of a node x is 
delayed until all other expansion possibilities have been exhausted (as in the 
trace technique), then as each existential role restriction 3R.C is expanded it 
is possible to generate the complete set of concepts that constitute the initial 
label of the i?-successor; this will consist of C plus all the concepts derived 
from universal role restrictions in C(x).‘ If there exists another node with 
the same set of initial concepts, then the two nodes will have the same satis- 
fiability status. Work need be done only on one of the two nodes, potentially 
saving a considerable amount of processing, as not only is the work at one 
of the nodes saved, but also the work at any of the successors of this node. 

Care must be taken when using caching in conjunction with blocking as 
the satisfiability status of blocked nodes is not completely determined but is 
simply taken to be equal to that of the blocking node. Another problem with 
caching is that the dependency information required for backjumping cannot 
be effectively calculated for nodes that are found to be unsatisfiable as a 
result of a cache lookup. Although the set of concepts in the initial label of 
such a node is the same as that of the expanded node whose (un)satisfiability 
status has been cached, the dependency sets attached to the concepts that 
made up the two labels may not be the same. However, a weaker form of 
backjumping can still be performed by taking the dependency set of the 
unsatisfiable node to be the union of the dependency sets from the concepts 
in its label. 

A general procedure for using caching when expanding a node x can be 
described as follows: 

1. Exhaustively perform all local expansions, backtracking as required, until only 
existential value restrictions (if any) remain to be expanded. 

2. If there are no unexpanded existential value restrictions in C{x), then return 
the satisfiability status satisfiable to the predecessor node. 

3. Select (heuristically) an unexpanded existential role restriction from C{x), ex- 
panding it and any applicable universal role restrictions to create a new node 
y with an initial label C(y) (or create all such nodes and heuristically select 
the order in which they are to be examined) . 

4. If y is blocked, then its satisfiability status S is directly determined by the 
algorithm (normally satisfiable , but may depend on the kind of cycle that has 

7 This ordering is used in the trace technique to minimize space usage, and may be useful or 

even required for effective blocking. 
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been detected [Baader, 1991]). 

(a) If 5 1 = satisfiable, then return to step 2 without expanding C(y). 

(b) If S = unsatisfiable, then backtrack without expanding C(y). The depen- 
dency set will need to be determined by the blocking algorithm. 

5. If a set equal to C(y) is found in the cache, then retrieve the associated satis- 
fiability status S (this is called a cache “hit”). 

(a) If S = satisfiable, then return to step 2 without expanding £(y). 

(b) If S' = unsatisfiable, then backtrack without expanding C{y), taking the 
dependency set to be the union of the dependency sets attached to the 
concepts in C(y). 

6. If a set equal to C(y) is not found in the cache, then set L = C{y) and expand 

C(y) in order to determine its satisfiability status S. 

(a) If S = satisfiable and there is no descendant 2 of y that is blocked by an 
ancestor x' of y, then add L to the cache with satisfiability status S and 
return to step 2. 

(b) If S = satisfiable and there is a descendant 2 of y that is blocked by an 
ancestor x' of y, then return to step 2 without updating the cache. 

(c) If S = unsatisfiable, then add L to the cache with satisfiability status S 
and backtrack, taking the dependency set to be the one returned by the 
expansion of C{y). 

The problem of combining caching and blocking can be dealt with in 
a more sophisticated way by allowing the cached satisfiability status of a 
node to assume values such as “unknown" . These values can be updated as 
the expansion progresses and the satisfiability status of blocking nodes is 
determined. Such a strategy is implemented in the Dlp system. 

A further refinement is to use subset and superset instead of equality 
when retrieving satisfiability status from the cache: if £(x) is satisfiable, 
then clearly any £(y) C C(x) is also satisfiable, and if £(x) is unsatisfiable, 
then clearly any C(y) A £{x) is also unsatisfiable [Hoffmann and Koehler, 
1999; Giunchiglia and Tacchella, 2000]. However, using subsets and supersets 
significantly increases the complexity of the cache, and it is not yet clear 
if the performance cost of this added complexity will be justified by the 
possible increase in cache hits. 

The advantages of caching the satisfiability status are: 

• It can be highly effective with some problems, particularly those with a repeti- 
tive structure. For example, the Dlp system has been used to demonstrate that 
some of the problem sets from the Tableaux’98 benchmark suite are trivial when 
caching is used (all problems were solved in less than 0.1 s and there was little ev- 
idence of increasing difficulty with increasing problem size). Without caching, the 
same problems demonstrated a clearly exponential growth in solution time with 
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increasing problem size, and the system was unable to solve the larger problems 
within the 100 s time limit imposed in the test [Horrocks and Patel-Schneider, 
1999], 

• It can be effective with both single satisfiability tests and across multiple tests 
(as in KB classification). 

• It can be effective with both satisfiable and unsatisfiable problems, unlike many 
other optimization techniques that are primarily aimed at speeding up the detec- 
tion of unsatisfiability. 

The disadvantages are: 

• Retaining node labels and their satisfiability status throughout a satisfiability test 
(or longer, if the results are to be used in later satisfiability tests) involves a storage 
overhead. As the maximum number of different possible node labels is exponential 
in the number of different concepts, this overhead could be prohibitive, and it 
may be necessary to implement a mechanism for clearing some or all of the cache. 
However, experiments with caching in the Dlp system suggest that this is unlikely 
to be a problem in realistic applications [Horrocks and Patel-Schneider, 1999] . 

• The adverse interaction with dependency directed backtracking can degrade per- 
formance in some circumstances. 

• Its effectiveness is problem dependent, and (as might be expected) is most evident 
with artificial problems having a repetitive structure. It is highly effective with 
some of the hand crafted problems from the Tableaux’98 benchmark suite, it is 
less effective with realistic classification problems, and it is almost completely in- 
effective with randomly generated problems [Horrocks and Patel-Schneider, 1999]. 

• The technique described depends on the logic having the property that the sat- 
isfiability of a node is completely determined by its initial label set. Extending 
the technique to logics that do not have this property, for example those which 
support inverse roles, may involve a considerable increase in both complexity and 
storage requirements. 


9.6 Discussion 

To be useful in realistic applications, DL systems need both expressive log- 
ics and fast reasoners. Procedures for deciding subsumption (or equivalently 
satisfiability) in such logics have discouragingly high worst-case complex- 
ities, normally exponential with respect to problem size. In spite of this, 
implemented DL systems have demonstrated that acceptable performance 
can be achieved with the kinds of problem that typically occur in realistic 
applications. 

This performance has been achieved through the use of optimization tech- 
niques, a wide variety of which have been studied in this chapter. These 
techniques can operate at every level of a DL system; they can simplify the 
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KB, reduce the number of subsumption tests required to classify it, replace 
tableau subsumption tests with less costly tests, and reduce the size of the 
search space resulting from non-deterministic tableau expansion. Amongst 
the most effective of these optimizations are absorption and backjumping; 
both have the desirable properties that they impose a very small additional 
overhead, can dramatically improve typical case performance, and hardly 
ever degrade performance (to any significant extent). Other widely applica- 
ble optimizations include enhanced traversal, normalization, lazy unfolding, 
semantic branching and local simplification; their effects are less general and 
less dramatic, but they too impose low overheads and rarely degrade per- 
formance. Various forms of caching can also be highly effective, but they 
do impose a significant additional overhead in terms of memory usage, and 
can sometimes degrade performance. Finally, heuristic techniques, at least 
those currently available, are not particularly effective and can often degrade 
performance. 

Several exciting new application areas are opening up for very expres- 
sive Description Logics, in particular reasoning about database schemata 
and queries, and providing reasoning support for the Semantic Web. These 
applications require logics even more expressive than those implemented 
in existing systems, in particular logics that include both inverse roles and 
number restrictions, as well as reasoning with general axioms. The challenge 
for DL implementors is to demonstrate that highly optimized reasoners can 
provide acceptable performance even for these logics. This may require the 
extension and refinement of existing techniques, or even the development of 
completely new ones. 

One promising possibility is to use a more sophisticated form of depen- 
dency directed backtracking, called dynamic backtracking [Ginsberg, 1993] , 
that preserves as much work as possible while backtracking to the source of 
a contradiction. Another useful approach, indicative of the increasing matu- 
rity of existing implementations, is to focus on problematical constructors 
and devise methods for dealing with them more efficiently. Good examples 
of this can be seen in the Racer system, where significant improvements 
in performance have been achieved by using more sophisticated techniques 
to deal with domain and range constraints on roles (see Chapter 2 for an 
explanation of these constructs) and qualified number restrictions [Haarslev 
and Moller, 2001c; 2001d; 2001a]. 

Finally, it should be re-emphasized that, given the immutability of theo- 
retical complexity, no (complete) implementation can guarantee to provide 
good performance in all cases. The objective of optimized implementations 
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is to provide acceptable performance in typical applications and, as the def- 
inition of “acceptable” and “typical” will always be application dependent, 
their effectiveness can only be assessed by empirical testing. Hopefully, the 
new generation of highly optimized DL systems will demonstrate their ef- 
fectiveness by finding more widespread use in applications than did their 
predecessors. 




Part III 

Applications 
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Conceptual Modeling with Description Logics 

Alex Borgida 
Ronald J. Brachman 


Abstract 

The purpose of the chapter is to help someone familiar with DLs to un- 
derstand the issues involved in developing an ontology for some universe of 
discourse, which is to become a conceptual model or knowledge base repre- 
sented and reasoned about using Description Logics. 

We briefly review the purposes and history of conceptual modeling, and 
then use the domain of a university library to illustrate an approach to 
conceptual modeling that combines general ideas of object-centered model- 
ing with a look at special mo deling/ ontological problems, and DL-specific 
solutions to them. 

Among the ontological issues considered are the nature of individuals, 
concept specialization, non-binary relationships, materialization, aspects of 
part-whole relationships, and epistemic aspects of individual knowledge. 


10.1 Background 

Information modeling is concerned with the construction of computer-based 
symbol structures that model some part of the real world. We refer to such 
symbol structures as information bases, generalizing the term from related 
terms in Computer Science, such as databases and knowledge bases. More- 
over, we shall refer to the part of a real world being modeled by an informa- 
tion base as its universe of discourse (UofD). The information base is checked 
for consistency, and sometimes queried and updated through special-purpose 
languages. As with all models, the advantage of information models is that 
they abstract away irrelevant details, and allow more efficient examination 
of both the current, as well as past and projected future states of the UofD. 

An information model is built up using some language, and this language 
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influences (more or less subtly) the kinds of details that are considered. For 
example, early information models (e.g., relational data models) were built 
on conventional programming notions such as records, and as a result fo- 
cused on the implementation aspects of the information being captured, as 
opposed to the representational aspects. Conceptual models offer more ex- 
pressive facilities for modeling applications directly and naturally [Hammer 
and McLeod, 1981], and for structuring information bases. These languages 
provide semantic terms for modeling an application, such as entity and re- 
lationship (or even activity, agent and goal), as well as means for organizing 
information. 

Conceptual models play an important part in a variety of areas. The 
following is a brief summary of these areas, as reviewed in [Mylopoulos, 
1998]: 

• Artificial intelligence programs turned out to require the representation of a great 
deal of human knowledge in order to act “intelligently” . As a result, they relied 
on conceptual models built up using knowledge representation languages, such 
as semantic networks - directed graphs labeled with natural language identifiers. 
DLs are the historical descendants of attempts to formalize semantic networks. 

• The design of database systems was seen to have as an important initial phase the 
construction of a “ conceptual level schema" , which determined the information 
needs of the users, and which was eventually converted to a physical implementa- 
tion schema. Chen’s Entity-Relationship model [Chen, 1976], and later semantic 
data models [Hull and King, 1987], were the result of efforts in this direction. 

• More generally, the development of all software has an initial requirements ac- 
quisition stage, which nowadays is seen to consist of a requirements model that 
describes the relationship of the proposed system and its environment. The envi- 
ronment in this case is likely to be a conceptual model. 

• Independently, the object-oriented software community has also proposed view- 
ing software components (classes/objects) as models of real-world entities. This 
was evident in the features of Simula, the first object-oriented programming lan- 
guage, and became a cornerstone of most object-oriented techniques, including 
the current leader, UML [Rumbaugh et al., 1998]. 

One interesting aspect of conceptual modeling in the database context 
has been the identification of a number of abstraction mechanisms that 
support the development of large models by abstracting details initially, 
and then introducing them in a stepwise and systematic manner. Among 
the important abstractions are the following: 

• thinking of objects as wholes, not just a collection of their attributes/components 
(“aggregation”); 
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• abstracting away the detailed differences between individuals, so that a class can 
represent the commonalities (“classification” 1 ); 

• abstracting the commonalities of several classes into a superclass (“generaliza- 
tion” ) . 

An important claim regarding the benefits of abstraction in conceptual 
modeling is that it results in a structured information model, which is easier 
to build and maintain. Interestingly, Description Logics further this goal by 
supporting the automatic classification of concepts with respect to others, 
thereby revealing generalizations that may not have been recognized by the 
modeler. 


10.2 Elementary Description Logic modeling 

Most conceptual models, including Description Logics, subscribe to an 
object- centered view of the world. Thus, their ontology includes notions like 
individual objects, which are associated with each other through (usually 
binary) relationships, and which are grouped into classes. In this chapter we 
use freely the notation and concrete syntax of Description Logics (see Ap- 
pendix), and extend it with additional constructs that make it more suitable 
for modeling. 

In the domain of a university library, we might encounter a particular 
person, GIANNI, or a particular book, BOOK23. Most of the information 
about the state of the world is captured by the interrelationships between 
individuals, such as GIANNI having borrowed BOOK23. Binary relationships 
are modeled directly in Description Logics using roles and attributes: either 
GIANNI is a filler of the lentTo role for BOOK23, or BOOK23 is the filler of 
the hasBorrowed role for GIANNI. Note that lentTo and hasBorrowed are con- 
verse relationships, and this should be captured in a model, since frequently 
one wants to access information about associations in either direction. In 
Description Logics, this is accomplished using the role constructor inverse: 

hasBorrowed = (inverse lentTo). 

Note that in order to avoid inadvertent errors during modeling due to 
confusion between a role and its converse, or between a role and the kind 
of values filling it, one heuristic is to use a natural language name that is 
asymmetric, and adopt the convention that the relationship R(a, b ) should be 
read as “a R b”; therefore in the above case lentTo(BOOK23, GIANNI) reads 

1 This term is used in a completely different way than in DL terminology, where it refers to the 

DL-KBMS service of finding the lowest subsumers of a concept or individual. 
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“BOOK23 lent To GIANNI,” while lentTo(GIANNI,BOOK23) reads “GIANNI 
lent To BOOK23”, which makes it clear that the first, but not the sec- 
ond, is the proper way to use the role lentTo in the model. On the other 
hand, loan would be a poor choice of a role identifier because one could 
equally well imagine loan as a role of books or of persons, so that neither 
loan(GIANNI,BOOK23) nor loan(BOOK23, GIANNI) “reads” properly. 

In addition, it is always important to distinguish functional relationships, 
like lentTo (a book can be loaned to at most one borrower at any time) 
from non-functional ones, like hasBorrowed. This is done most cleanly if 
the particular Description Logic being used allows the declaration of func- 
tional relationships, sometimes called “attributes” or “features”. Attributes 
themselves come in two flavors: total and partial. Thus lentTo is a partial 
attribute because a book can only be loaned to one person, but may not be 
on loan at some point of time; on the other hand, every book has to have an 
isbnNr. It is important to check which interpretation of attributes is offered 
by the particular Description Logic being used. In the rest of this chapter 
we assume that attributes are total, and the concept constructor the will be 
used as an abbreviation, so that (the p C) is equivalent to the conjunction 
of (all p C), (at-most 1 p) and (at-least 1 p). 

Individuals are grouped into classes; for example, Book might be a nat- 
ural class in our domain. Classes usually abstract out common properties 
of their instances, e.g., every book in the library has a call number. Classes 
are modeled by concepts in Description Logics, and usually the common 
properties are expressed as subsumption axioms about the concept. These 
conditions usually involve superconcepts, as well as the kinds of values that 
can fill roles, and limits on the number of (various kinds of) role fillers. By 
design, these are exactly the kinds of things that can be expressed using DL 
constructors: 


/* Books are materials, whose callNr is an integer */ 

Book C (and Material 

(the callNr Integer) 

...). 

As mentioned in earlier chapters, one of the fundamental properties of 
Description Logics is support for the distinction between primitive/atomic 
concepts - for which instances can only be declared explicitly - and defined 
concepts - which offer necessary and sufficient conditions for membership. 
So, for example, we can distinguish the notion of “borrower” as someone 
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who can borrow a book (an approved customer of the library) 

/* Borrower is previously declared as a primitive concept. 

Here it is indicated what restrictions on borrowing are in force for this concept */ 

Borrower C (all hasBorrowed Book) 

from the notion of “borrower” as someone who has actually borrowed a book 
from the library 

/* Borrower is defined as someone who has borrowed books */ 

Borrower = (and (all hasBorrowed Book) 

(at-least 1 hasBorrowed)). 


We now turn to a variety of more subtle issues that arise when modeling 
a domain. Almost all of these issues arise independently of the modeling 
language used; what we emphasize here is the range of possible solutions in 
the DL framework. 


10.3 Individuals in the world 

Some individuals are quite concrete, like a particular person, Gianni, or a 
particular copy of a book. Some are more abstract, like the subject matter 
covered by a book. The important property of most individuals is that they 
have an identity , which allows them to be distinguished from one another 
and to be counted. 

Modeling of individuals is therefore made easier if they have unique iden- 
tifiers. Unfortunately, this may not always be the case. For example, if one 
sees on a bookshelf two brand new copies of a book, which may not be 
distinguishable by any property known to us, one can still say that they 
are different copies of the book. In information management systems, and 
sometimes in the real world, this leads us to devise some kind of “extrinsic” 
identification scheme. For example, books on the library shelf are assigned a 
copy number. In this paper, as in object-oriented software systems, we will 
tend to assign arbitrary internal identifiers to objects, such as GIANNI or 
BOOK23. 

The following examples concerning books show that what constitutes a 
relevant individual in a UofD depends very much on what we want to do 
with the information. In a domain concerning literature courses, one might 
consider something like Dickens’ HARD-TIMES as the kind of individual 
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appearing on an assigned reading list. For an Internet book-seller interface, 
it is necessary to consider a more concrete level of modeling - that of book 
editions, since these may have different prices. Finally, in a library, we need 
to keep track of actual physical book copies. 

In the last two cases, one must then decide whether to model books (as 
opposed to editions or copies) as individuals, or as concepts that have the 
other kinds of individuals as instances. A general heuristic is that if we expect 
certain notions to be counted, then they must be modeled as individuals. 
Another heuristic is that notions that do not have an inception time are 
usually modeled as concepts. 

Modeling of the particular kind of relationship that exists, for example, 
between a book and its editions is further examined in Subsection 10.7.2. 


10.3.1 Values vs. objects 

It is important to distinguish what we may call individual objects , such as 
GIANNI, from values, such as integers, strings, lists, tuples, etc. The former 
have an associated intrinsic and immutable identity, and need to be created 
in the knowledge base. The later are “eternal” mathematical abstractions, 
whose identity is determined by some procedure usually involving the struc- 
ture of the individual. For example, the two strings “abc” and “abc” are 
the same individual value because they have the same sequence of charac- 
ters; similarly for dates, such as 1925/12/20, which can be considered as 
3-tuples. 

Many Description Logics only support reasoning with objects, in which 
case composite values such as dates need to be modeled as objects with 
attributes for day, month and year. The danger here is that, for example, 
multiple date individuals can be created with the same attribute values, in 
which case they are treated as distinct for the purposes of counting and 
identity checking, resulting in reasoning anomalies. Implemented Descrip- 
tion Logics such as Classic support values from the underlying program- 
ming language (so-called “host values” ) , and relatively simple concept hier- 
archies over them. Others, such as ACC{T>) [Baader and Hanschke, 1991a] 
and STLOQ{T>) [Horrocks and Sattler, 2001] allow attributes to have values 
from so-called “concrete domains” , which can contain entirely new kinds of 
values. These concrete domains are required to have their own, independent 
reasoners, which are then coupled with the DL reasoner. 

Equally desirable would be mathematical types such as sets, bags, se- 
quences, and tuples, as supported by modern programming languages and 
certain semantic data models. 
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Currently, only the highly expressive VCR languages support notions such 
as n-tuples and recursive fixed-point structures, from which one can build 
lists, trees, etc. Even here, one can only provide the description of concepts 
(“list of Persons”), as opposed to the specification of individuals (“the list 

[GIANNI, ANNA]”). 


10.3.2 Individuals vs. references to them 

It is important to distinguish an individual from various references to it: 
Gianni vs. “the person whose first name is the 6-letter string “Gianni” vs. 
“the borrower with library card number 32245” vs. “the chairman of the 
Psychology Department” . This distinction becomes crucial when we express 
relationships: there is a difference between relating two objects and relating 
their names, because we usually want objects to remain related, even if 
names are changed. Thus “GIANNI hasBorrowed BOOK25” is different from 
“card-holder number 32245 hasBorrowed BOOK25” , because if Gianni gets 
a new card (after losing his old one, say), then the relationship between 
Gianni and the book is lost. So, in general, one should always deal with the 
individual objects, unless there is a bijection between a class of objects and 
a class of referents to them, and this bijection is universal (it always exists) 
and is unchanging. 2 Kent [1979] has eloquently argued the importance of 
these issues in record-based database systems, and shows that in the real 
world such bijections are much rarer than assumed. For example, Neumann 
[1992] reports that the same US social security number (the prototypical 
identifier for persons in the USA) has been issued to two people, who even 
have the same name and birth-date! 

Conversely, in some cases one wants to state relationships between inten- 
sional references, rather than specific objects. For example, we might want 
to say that, in general, the director of the library is the head of the book 
selection committee (C0MMITTEE3). If Gianni happens to be the current 
director of the NBU library, then asserting head0f(GIANNI,C0MMITTEE3) 
is improper because, for example, if Gianni steps down as director, ac- 
cording to the above model he would still be committee chair. One needs 
the ability to use unnamed expressions as arguments of relationships, 
along the lines of the predicate logic expression headOf(directorOf(NBU- 
LIBRARY),C0MMITTEE3). 

In Description Logics, intensional referents can be expressed as roles that 
are applied to individuals. (The roles may often be complex chains, resulting 

2 Such bijections are exactly the “keys” used in the database context. 
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from the composition of atomic roles, as in “the zipCode of the address of 
the lentTo”.) Assuming that we use the notation NBU-LIBRARY. director to 
refer to the filler of the director role for the NBU-LIBRARY individual, the 
above relationship is actually stated as “NBU-LIBRARY. director is identical 
to COM MITTEE3. head”. The concept constructor same-as, indicating that 
two chains of roles have the same value, is used to express exactly such 
relationships, so the above situation might be modeled, naively, using the 
concept (same-as director head). The problem is that we need a single indi- 
vidual of which to assert this property, yet it is libraries that have directors 
while committees have heads. In such situations, in Description Logics one 
must find or create some chain of attributes relating the two individuals 
NBU-LIBRARY and C0MMITTEE3. The natural relationship in this case is 
the attribute hasBookSelectionCommittee. Therefore the appropriate way of 
modeling this situation is 

/* NBU-LIBRARY has book selection committee COMMITTEE3 */ 

hasBookSelectionCommittee( NBU-LIBRARY, COMMITTEE3) 

/* NBU-LIBRARY. director equals 

NBU-LIBRARY. hasBookSelectionCommittee. head */ 

(same-as director (hasBookSelectionCommittee o head))(NBU-LIBRARY). 


10.4 Concepts 

For the university library, some obvious classes of individuals include peo- 
ple, institutions, the material that can be loaned by the library, the staff, 
dates, library cards, and fines. These classes are normally modeled using 
atomic/ primitive concepts in Description Logics. 

It may be worth noting that in Description Logics the same individual may 
be an instance of multiple classes, without one being necessarily a subclass 
of another: some book might be an instance of both hardcover and science 
books. This is in contrast with many other object-oriented software systems, 
where one is forced to create a special subclass for this notion, in order to 
guarantee a unique “minimal” class for every individual. However, this is 
not a modeling principle - it is an implementation obstacle. 


10. 4-1 Essential vs. incidental properties of concepts 

As explained in the earlier example involving the two possible meanings 
for the term “borrower”, an important feature of Description Logics is the 
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ability to distinguish primitive from defined concepts, where the latter have 
necessary and sufficient conditions for concept membership. 

For example, BookOnLoan might naturally be defined as 

/* A book is on loan if it is borrowed by someone */ 

BookOnLoan = (and Book (at-least 1 lentTo)). 

Suppose that we also want to require that only hardcover books can be 
loaned out. There seem to be two options for modeling this: 

/* Option 1 - being hardcover is part of the definition */ 

BookOnLoan = (and Book 

(at-least 1 lentTo) 

(fills b inding 'hardcover)) 

/* Option 2 - being hardcover is an additional necessary condition */ 

BookOnLoan = (and Book (at-least 1 lentTo)) 

BookOnLoan C (fills binding 'hardcover). 

The first approach is not quite right because being hardcover is an inci- 
dental property of books on loan, albeit one universally shared by all such 
objects. Among other things, this means that if the system is to recognize 
some individual book as being on loan, it is enough to know that it has been 
lent to someone - one does not also need to know it is hardcover. Hence the 
second modeling option is the right one, since one can actually deduce that 
a book on loan is hardcover, if this was not known ahead of time. 

The distinction between definitional and incidental properties is also 
important if we consider the task of classifying concepts into a taxonomy, 
since it has been argued that the taxonomy should not depend on contingent 
facts. This suggests that incidental properties, even universal inclusion 
assertions like the one for hardcover books in Option 2 above, should 
appear in the ABox, not the TBox defining the terminology. 

Another subtle problem arises when there are multiple sufficient condi- 
tions for a concept. For example, suppose we associated a due date with 
books on loan (in the physical world, this might be recorded as a date 
stamped in the back of the book). Then encountering a book with a due 
date in the future would rightly classify it as a book on loan. If we model 
the due date as an attribute of books, which has a value only as long as the 
date is in the future, then we would represent this situation as 

(and Book (at-least 1 dueDate)) C BookOnLoan 
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and, of course, requiring books on loan to have a due date would lead to 


BookOnLoan C (at-least 1 dueDate). 

We thus have multiple sufficient conditions for being a book on loan, al- 
though one of them appears to be the primary definition. 


10. 4 -2 Reified concepts and meta-roles 

In some cases it seems natural to associate information with an entire 
concept, rather than with each of its individual instances. One situation 
where this arises is in capturing aggregate information, such as the count 
of current individual instances of the concept, or the average value of their 
attributes. In the library example, attributes such as numberOfBooks and 
mostRequested Books would fall into this category. 

In some object-oriented systems this can be modeled directly because 
classes are themselves objects, and as such are instances of nreta-classes 
and have nreta-properties. Currently, Description Logics do not have a 
facility to treat classes as objects. One must therefore create a separate 
“meta-individual” that is related to the concept by some naming conven- 
tion, for example. In our example, we would create the individual BOOK- 
CLASS-OBJECT, and then attach the information regarding numberOfBooks, 
mostRequested Books, etc., as roles of this individual. In the Classic system, 
given a named concept, this meta-individual can be retrieved using a special, 
new knowledge base operation. 


10.4-3 Concepts dependent on relationships 

The following interesting modeling problem arises in many situations: some 
concepts, such as Book, stand on their own. Others, such as Borrower, rely 
on the implied existence of some relation /event (e.g., lending), which has 
a second argument, and from which their meaning is derived. It is impor- 
tant to discern this second category of concepts, and explicitly introduce 
the corresponding binary relationship in the model. In the data modeling 
literature (e.g., [Albano et al . , 1993]) categories of this second type, such 
as Borrower, are called “roles”, but to avoid confusion with DL roles, we 
will call them “ relationship roles” . The modeling of these will be considered 
further in Subsection 10.7.1. 
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10.5 Subconcepts 

For many of the above concepts, there are specialized subconcepts represent- 
ing subsets of individuals that are also of interest. For example, the concept 
Material (referring to the holdings of libraries) could have specializations 
Book, Journal, Videotape, etc. In turn, Book may have subconcepts Mono- 
graph, EditedCollection, Proceedings, etc . 3 And Borrowers may be Institutions 
or Individuals, with the latter being divided into Faculty, Student, Staff. 

There are a number of special aspects of the subclass relationship that 
should be modeled in order to properly capture the semantics of the UofD. 


10.5.1 Disjointness of subconcepts 

In many cases, subclasses are disjoint from each other. For example, Book 
and Journal are disjoint subclasses of Material. In Description Logics that 
support negation, this is modeled by adding the complement of one concept 
to the necessary properties of the other concept: 

Book C not Journal 

Often, entire collections of subclasses are disjoint . 4 For this purpose, some 
Description Logics provide the ability to describe disjointness by naming a 
discriminator, and a special declaration operation for primitive subclasses. 
For example, one might discriminate between various kinds of material on 
the basis of the medium as follows: 

Print C (disjointPrim Material in group medium with discriminant paper) 
Video C (disjointPrim Material in group medium with discriminant light) 
Audio C (disjointPrim Material in group medium with discriminant sound) 

At the same time, one might discriminate between different kinds of ma- 
terial on the basis of the format: 

Book C (disjointPrim Material in group format with discriminant book) 
Journal C (disjointPrim Material in group format with discriminant journal) 


Two points are worth making here: (i) the advantage of a syntax based 
on discriminators is that it avoids the multiplicative effect of having to state 

3 For this section, we will think of the material to be loaned as physical individuals that can be 
carried out the door of the library, so to speak. 

4 This is especially the case at the top of the subclass hierarchy: Person, Material, etc. 
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disjointness for every pair of disjoint concepts; (ii) as in the above example, 
it is important to allow during modeling for multiple groups of disjoint 
subconcepts for the same concept. 


10.5.2 Covering by subconcepts 

In addition to disjointness, it is natural to consider whether some set of 
subclasses fully covers the superclass. For example, we might want to say 
that Circulating material must be either short-term or long-term. 

For Description Logics that support concept disjunction, this is easy: 

Circulating C (or ShortTerm LongTerm). 

Note that since ShortTerm, in turn, has Circulating as a superclass, the 
possibility arises of modeling Circulating as a definition: 

Circulating = (or ShortTerm LongTerm). 

However, this approach is not available for languages like Classic, which 
avoid disjunction in order to gain tractable reasoning. We discuss in the 
next section an approach to the problem based on subconcept definitions 
and enumerated values. 


10.5.3 Defined vs. primitive subconcepts 

In the case of material that is either circulating or non-circulating, the name 
of the second class provides a hint: after introducing Material and Circulating 
as primitives, NonCirculating should be defined: 

Circulating C Material 

NonCirculating = (and Material (not Circulating)). 


In this case, the Description Logic can deduce both the disjointness of Cir- 
culating and NonCirculating, and the fact that Material is the union of Circu- 
lating and NonCirculating, without having stated anything explicitly about 
either. This shows clearly the power of a reasoning system that is capable 
of supporting definitions. 

By joining covering and disjointness one gets the partitioning of a class by 
some group of subclasses. In some Description Logics - those supporting the 
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constructor one- of - it is possible to simulate the effect of declaring concepts 
as partitioned into subconcepts through the use of a special attribute. For 
example, we could add the attribute format to Books, with an enumerated 
set of possible values: 

Book C (the format (one-of ’monograph ’journal ’ editedCollection)) 

and then define the corresponding subclasses: 

Monograph = (and Book (fills format ’monograph)) 

Journal = (and Book (fills format ’journal)) 

EditedCollection = (and Book (fills format ’ editedCollection)). 

These concepts will be disjoint because format can have at most one value, 

and they cover the original class Book, because format must have (at least) 
one value from among the set enumerated. 


10.5.4 Dynamics of (sub) concept membership 

When changes in the model are allowed, there is a distinction between con- 
cepts that represent inherent properties of objects that do not change over 
time (called “rigid” in [Guarino and Welty, 2000]) such as Book, and con- 
cepts that represent more transient properties, such as Misplaced Book. Note 
that while it is possible for a transient property to be a subconcept of rigid 
one, the converse does not make sense. 

Standard Description Logics have not developed modeling tools for issues 
involving the dynamics of the world, and hence usually cannot represent 
such distinctions. Description Logics extended with the notion of time, such 
as [Artale and Franconi, 1998], are of course well suited to express them. 


10.5.5 The structure of the subconcept hierarchy 

Recent work by Guarino and Welty (e.g., [Guarino and Welty, 2000]) has 
presented several interesting ontological dimensions along which a concept 
can be positioned. 

The dimensions are related to many of the topics we discuss elsewhere 
in this chapter, including the existence or absence of criteria for identifying 
individuals (Section 10.3), the rigid vs. non-rigid nature of concept member- 
ship (Subsection 10.5.4), the nature of the part-whole relationship (Subsec- 
tion 10.7.3), and aspects resembling relationship roles (Subsection 10.7.1). 
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The significance of these dimensions is that they can be used to both clar- 
ify the intended meaning of concepts in an ontology, and to better organize 
the taxonomy of primitive concepts. The conditions for proper taxonomies 
are based on observations such as “a concept some of whose current in- 
stances may cease to be instances at some point in the future (e.g., Student,) 
cannot subsume a concept whose membership cannot change (e.g., Person,).” 
We refer the reader to the original paper for further details. 


10.6 Modeling relationships 

As mentioned earlier, binary relationships are modeled in Description Logics 
using roles and attributes. Just as with subclasses, there are a number of 
special constraints that are frequently expressed about relationships: cardi- 
nality constraints state the minimum and maximum number of objects that 
can be related via a role; domain constraints state the kinds of objects that 
can be related via a role; and inverse relationships between roles need to be 
recorded. For example, a book has exactly one title, which is a string, and 
exactly one call number, which is some value that depends on the cataloging 
technique used. On the other hand, there may be zero or more authors for 
a book: 

Book C (and (the title String) 

(the callNr M ateria I Identifier) 

(all author Person)). 

As mentioned in Section 10.2, we can use the attribute lentTo to model when 
someone borrows a book: 

Book C (all lentTo Borrower). 

Suppose we also want to record that the material in the library may be 
on loan, available or missing. This can be modeled by adding appropriate 
roles to the library: 

Library C (and (all hasOnLoan Material) 

(all hasAvailable Material) 

(all hasMissing Material)). 

In such a case we would like to say that these roles are non-overlapping. 
This could be accomplished through the use of a concept constructor 
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non-overlapping, syntactically similar to same-as: (non-overlapping hasOn- 
Loan hasAvailable). However, if only one library is involved, it would be bet- 
ter to model the situation using an appropriate subclass of Material, such as 
MissingMaterial, because we already have tools for modeling disjointness of 
subclasses, and reasoning with them is not inherently hard as is the case of 
general constructors such as same-as and non-overlapping. 


10.6.1 Reified relationships 

It is sometimes useful to be able to give “properties of properties”. For 
example, when some material is lent to a borrower, it is useful to record on 
what date the loan took place and when the material is due back. In the 
Entity-Relationship approach this would be modeled by the creation of a 
relationship class, called Loan, which would have attributes onLoan, lentTo, 
as well as lentOn and dueOn, describing the loan. This can be thought of 
as the reification of the relationship, and results in the following DL class 
specification: 

Loan C (and (the lentTo Borrower) 

(the onLoan Material) 

(the lentOn Date) 

(the dueOn Date) 

(the NrOfRenewals (max 3))). 

Unless the DL supports n-ary relations, reified relationships become es- 
sential when modeling associations that involve more than two objects, as 
would be the case, for example, if we had several libraries (or branches), and 
we wanted to record from which library the loan was made. 

Reified relationships have the disadvantage of requiring the modeler to 
distinguish somehow the subset of attributes determining the relationship 
R(a,b,...) from those qualifying it. In the above case, we may imag- 
ine that Loan represents a binary relationship Loan(Borrower, Material) be- 
tween lentTo and onLoan (in which case lentOn is there just to qualify 
the relation); alternatively, we may interpret Loan as a ternary relation- 
ship Loan (Borrower, Material, Date) between lentTo, onLoan and lentOn. The 
former records loans (a borrower may have a book at most once) while the 
latter records the history of loans. The notion of “keys/unique identifiers” 
from databases, as adapted to Description Logics [Borgida and Weddell, 
1997] can be used for this task, by marking the collection of attributes that 
describe the relationship as a key. 
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We remark that the T>CTZ Description Logic can express n-ary relation- 
ships directly, so it does not require reification for this purpose. 


10.6.2 Role hierarchies 

In many applications, two roles on the same concept may be related by the 
constraint that every filler of the first role must be a filler of the second 
role. For example, in the library domain, the fillers of the role hasOnShort- 
TermLoan, recording a borrower’s materials that need to be returned within 
a week, are also fillers of hasBorrowed, recording all the materials borrowed 
(this would be true by definition). Similarly, the editorlnChief of a journal 
would be included in its editorialStaff. 

One of the important features of frame knowledge representation schemes, 
and Description Logics in particular, is that they encourage the modeler to 
think of roles as first-class citizens. This includes support for the notion 
of a role taxonomy (subroles). This is all the more reasonable, since once 
we reify a relationship, we would be allowed to create subconcepts of it at 
will. 

As a result, the above kinds of constraints on the containment of role fillers 
can be modeled through the use of role hierarchies - a notion supported by 
most Description Logics, at least for primitive roles: 

hasOnShortTermLoan C hasBorrowed. 


10.7 Modeling ontological aspects of relationships 

The material in this section deals with some special kinds of relationships 
and approaches to modeling them. The cognoscenti will recognize these as 
issues related to the ontological aspects of a UofD (constructs relating to 
the essence of objects), as opposed to epistemological aspects (constructs 
relating to the structure of objects), which are captured by notions such as 
InstanceOf and IS-A. The kinds of relationships to be discussed below do 
however occur relatively frequently, and pose difficulties to the uninitiated. 


10.7.1 Relationship roles 

A subtle, but important, distinction can be drawn between objects that may 
participate in a relationship (the domain restrictions on the role) and the 
objects that actually do take part in one or more relationships. For example, 
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the objects participating in a lending relationship can be said to be playing 
certain “roles”: LentObject and Borrower. It was exactly this second meaning 
of borrower - as a relationship role - that was contrasted with the original 
meaning of “potential borrower” in our example of Section 10.2. 

Description Logics allow one to define the relationship roles associated 
with a relationship. In the case when the relationship is modeled by a regular 
DL role, such as borrowed By, we can define lent objects as ones that are being 
borrowed, and borrowers as objects that are the values of borrowed By: 


LentObject = (at-least 1 borrowedBy) 

Borrower = (at-least 1 (inverse borrowedBy)). 

In the case of the reified Loan relationship, the definition of these classes 
would be 


LentObject 

Borrower 


(at-least 1 (inverse onLoan)) 
(at-least 1 (inverse lentTo)). 


10.7.2 Materialization 

There is a family of situations whose modeling is complicated by the fact 
that several concepts can be referred to by the same natural language term. 
For example, one might say “Shakespeare wrote ‘Hamlet’,” “The ‘Hamlet’ 
in London this season is a success,” and “‘Hamlet’ was canceled tonight.” 
But there is a difference between the abstract notion of the play ‘Hamlet’, 
various stagings of the play, and particular performances. Other familiar 
distinctions of this kind include the difference between an airline flight ( “Air 
France flight 25 from Paris to London” ) and a particular “instance” of it - 
the one that will leave on May 24, 2002. Failure to model such differences 
can result in the same kind of problem that arises with any other form of 
ambiguity - inappropriate use in a context. So one can only buy tickets to 
play performances, but theatrical awards are given to stagings. 

In each of these cases there is a relationship between a general notion (e.g., 
play staging) and 0-to-N more specific notions (e.g., performance of that 
play staging), which has been called materialization , and was investigated 
in [Pirotte et al., 1994], 

Let us first model some information that we would like to capture in the 
library domain: 
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/* Books have information about authors, etc. */ 

Book C (and . . . 

(all hasAuthors Person) 

(the hasTitle String)) 

/* Editions of books are related to the book (in a way yet to be specified) 
but have their own roles too */ 

BookEdition C (and . . . 

(the publishedBy PublishingCompany) 

(the isbnNr IsbnNumber) 

(the format (one-of 'printed ’audio))) 

/* Book copies are related to book editions, and in turn have their own roles */ 

BookCopy C (and . . . 

(the callNr CallNumber) 

(the atBranch LibraryBranch)). 

There are several alternative ways of proceeding with the modeling of such 
a UofD. 

Since objects in each of these classes are seen to naturally have attributes 
like hasTitle, it is tempting to think of BookCopy as being a subclass of 
BookEdition so that this attribute is inherited. However, this would mean 
that each individual instance of BookCopy is a separate BookEdition, which 
seems wrong. 

If we are not committed to modeling separate individual instances of each 
of these concepts, it is possible to combine their description into a single 
concept that records all the relevant information. So, for example, we could 
define Books to have all the attributes of the three concepts above, and thus 
really refer to book copies. (But see below.) 

Finally, according to the results in [Pirotte et al, 1994], a more appropriate 
approach is to view each edition of a book as determining a subclass of 
BookCopy. Each of these subclasses can then be viewed as an instance of 
BookEdition, for which it provides so-called “meta-roles.” Materialization is 
the combination of these ideas. 

The materialization relationship can be modeled in Description Logics by 
a role materializationOf, connecting in our case book editions and books, and 
book copies and book editions. However, this sounds very unnatural when 
read out loud, so a better approach may be to create subroles of the general 
role materializationOf. This means that the above model would be completed 
by adding the following assertions 
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/* editionOf is a kind of materialization relationship */ 

editionOf C materializationOf 

/* Book editions are materializations of books */ 

BookEdition C (the editionOf Book) 

/* copyOf is a kind of materialization relationship */ 

copyOf C materializationOf 

/* Book copies are materializations of book editions */ 

BookCopy C (the copyOf BookEdition). 

Often, the properties of the more abstract concept are inherited by the 
materialization. For example, the book edition, and then the book copy, has 
the same title and author as the book. In Description Logics, this relation- 
ships can be expressed by identifying the appropriate attribute values on 
the general and the materialized object: 

BookEdition C (same-as hasTitle (editionOf o hasTitle)). 

Several additional kinds of relationships between attributes of an object 
and its materialization are identified in [Pirotte et al, 1994], but they are 
rather unclear and cannot be represented in Description Logics. Probably the 
most interesting is the case when an attribute of the more general concept 
has no correspondent on materialized individuals. For example, though a 
book edition may reasonably record the date when it was first and last 
printed, it seems very questionable to say that a book copy has a last printing 
date. 

This looks like a case of meta-roles of the kind mentioned earlier. The main 
importance is that if one wants to have in the model attributes such as first- 
Printing, then one cannot “melt” objects (book editions) into their various 
materializations (book copies), and is forced to model them separately. 

10.7.3 Part-whole aggregation 

The part-whole relationship distinguishes roles of a book such as its chap- 
ters, from others such as its publisher. There is a long history of discussions 
concerning this topic, with [Artale et al., 1996b] being an excellent and 
comprehensive survey that considers, among other things, a variety of DL 
solutions to the problem. We present here some interesting observations. 

Cognitive scientists have distinguished a variety of part-whole relation- 
ships, whose mixture has caused apparent paradoxes; according to one 
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hypothesis these can be distinguished by differentiating three kinds of wholes 
- complexes, collections and masses - with parts called components, mem- 
bers and quantities respectively; furthermore parts can be portions (sharing 
intrinsic properties with the whole) and segments. Most physical objects, like 
book copies, are complexes of their parts (e.g., pages), but in the book do- 
main we also find uses for collections in modeling books that are anthologies 
of other literary pieces. 

In addition, one can qualify the nature of two aspects of the relationship 
between parts and wholes: 

• Existence: A whole may depend on particular individual(s) for its continued ex- 
istence and identity, as in the case when the part is irreplaceable (e.g., a book 
must have an author); or it may depend generically on a class of parts (e.g., a 
book copy must have a cover). Conversely, the part may depend on the whole for 
its existence (e.g., the chapter of a book). Finally, a part may belong exclusively 
to only one whole or it might be shared. 

• Properties: Properties may be “inherited” from the whole to the part (e.g., 
owned By) or from the part to the whole (e.g., isDefective). 

At the very least, the above provides a checklist of issues to consider 
whenever a part-whole relationship is encountered during modeling. 

In the realm of Description Logics, Sattler [1995] offers an approach to 
dealing with these topics, exploiting various role-forming operators such as 
role hierarchies, role inverse, and transitive closure to capture the semantics 
of aggregation. 

Specifically, special roles are introduced for the different kinds of part- 
whole relationships mentioned above: hasDComponent, hasDMember, has- 
DSegment, hasDQuantity, hasDStuff, hasDIngredient, where “D” stands for 
“direct”. One then defines more complex relationships from these primi- 
tives: 

hasComponent = (transitive-closure 

(or ro ; e hasDComponent (hasDMember o hasDComponent))) 
hasPart = (or TO ; e hasComponent hasMember . . .) 


indicating that members of collections of components are also components, 
and that hasPart is the union of the various subkinds of relationships. 

Let us concentrate here on the component-of relationship, which is prob- 
ably the one most frequently encountered in practical applications. We shall 
consider the table of contents of a book as an exemplar of a component 
attribute. 
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One idea is to declare attributes and roles that represent components (e.g., 
tableOfContents) as specializations of hasDComponent. This allows us to dis- 
tinguish such component roles from other roles, like lentTo and publisher. 

Obviously, the inverses of such roles provide access from a part to its 
containing whole: 

isDComponentOf = (inverse hasDComponent) 

hasTableOfContents = (inverse contentsOf). 

Turning to “existence” constraints, a book (but not a copy of a book!) 
depends on the existence of its specific table of contents, and conversely. 
Although we can specify that a book must have table of contents, as with 
earlier “dynamic” aspects (such as (im)mutable class membership) standard 
Description Logics are not currently equipped to express constraints stating 
that an attribute value cannot change. 

To model the fact that each table of contents belongs exclusively to one 
book, we can use qualified number restrictions 

TableOfContents E (the contentsOf Book). 

Finally, the inheritance of properties (e.g., isDefective) across component- 
like attributes is modeled using constructs such as same-as, which relate 
attribute/role chains set-theoretically, in the same manner as shown with 
materialization: 

Book C (same-as isDefective (hasTableOfContents o isDefective)). 

Note however that several of these representations require quite expressive 
language constructs, whose combination may result in a language for which 
subsumption is undecidable. 


10.7.4 General constraints 

In many modeling exercises one will encounter general constraints that char- 
acterize valid states of the world. For example, the dueDate of a book must 
be later than the lentO n date. 

Except for a few cases involving identity of attribute paths, these con- 
straints will not be expressible in standard Description Logics, due to their 
limited expressive power. Several widely distributed systems, such as Clas- 
sic and Loom, offer “escape hatches” - concept constructors that allow 
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one to describe sets of individuals using some very powerful language, such 
as a programming language (Classic’s test-concepts) or some variant of 
first-order logic (Loom’s assertions). These concept definitions are usually 
opaque as far as concept-level reasoning is concerned, because the system 
cannot guarantee correctness for such an expressive formalism. However, 
these concepts can have an impact as far as the ABox reasoning is con- 
cerned, since the latter resembles a logical model, and therefore we can do 
relatively simple “evaluation” as a way of recognizing individuals. Thus, in 
Classic, the test-concept (test date-after (dueDate lentOn)) would invoke 
the date-after function on the dueDate and lentOn attributes of an individual 
object, and check that the first is temporally after the second, thus classify- 
ing individuals, or detecting errors in the ABox. 

More general than these procedural extensions are DL systems that are 
extensible in the sense that a “knowledge language engineer” can add new 
concept constructors, and extend the implementation in a principled way. 
For example, if we wanted to deal with dates and durations (clearly a de- 
sirable feature for libraries), we would want to be able to compare dates, 
add durations to dates, etc. General approaches to extending Description 
Logics have been described, among others, in [Baader and Hanschke, 1991b; 
Borgida, 1999; Horrocks and Sattler, 2001]. 


10.7.5 Views and contexts 

Although the initial goal is usually to provide a single model of the UofD, 
it turns out to be very important to preserve the various “views” of the 
information seen by different stake-holders and participants. For example, 
a book that is in the library (and by definition, this would mean that it 
has no value for the lentTo role) is of interest to the staff, for example to 
help find it; for this, it may have a role location, which might specify some 
shelf or sorting area; this attribute may be attached to the MateriallnLibrary 
concept. 

On the other hand, a view of Material called MaterialOnLoan (which re- 
quires a lentTo role value), would be a natural place to keep information 
about dueDate and nrOfRenewals - attributes that would normally appear 
on the relationship itself. This view is of particular interest to the borrower, 
but also the staff in charge of sending overdue notices. 

Incidentally, the above pattern of replacing a binary relationship having 
attributes by two views can be applied any time one of the participants in 
the relationship is restricted to appear in at most one tuple (e.g., every book 
can be loaned to at most one borrower). 



399 


Conceptual Modeling with Description Logics 

10.8 A conceptual modeling methodology 

The world of object-oriented software development has produced a vast lit- 
erature on methodologies (e.g., [Shlaer and Mellor, 1988]) for identifying 
objects, classes, methods, etc., for a particular application. Instead of con- 
sidering this voluminous material here, we will recapitulate some of the issues 
raised above by extending the outline of a simple DL knowledge engineering 
methodology first presented in [Brachman et al, 1991]. The reader is referred 
to that article for more details, including a long worked-out example. 

We present the main steps of modeling, with suggestions for refinements 
to be accomplished in later passes; this is in order to avoid the modeler 
becoming overwhelmed by details: 

• Identify the individuals one can encounter in the UofD. Revisit this later consid- 
ering issues such as materialization and values. 

• Enumerate concepts that group these values. 

• Distinguish independent concepts from relationship-roles. 

• Develop a taxonomy of concepts. Revisit this later considering issues such as 
disjointness and covering for subconcepts. 

• Identify any individuals (usually enumerated values) that are of interest in all 
states of the world in this UofD. 

• Systematically search for part-whole relationships between objects, creating roles 
for them. Later, make them subroles of the categories of roles mentioned in Sec- 
tion 10.7.3. 

• Identify other “properties” of objects, and then general relationships in which 
objects participate. 

• Determine local constraints involving roles such as cardinality limits and value 
restrictions. Elaborate any concepts introduced as value restrictions. 

• Determine more general constraints on relationships, such as those that can be 
modeled by subroles or same-as. (The latter often correspond to “inheritance” 
across some relationship other than IS-A, and have been mentioned in several 
places earlier.) 

• Distinguish essential from incidental properties of concepts, as well as primitive 
from defined concepts. 

• Consider properties of concepts such as rigidity, identifiers, etc., and use the 
techniques of [Guarino and Welty, 2000] to simplify and realign the taxonomy of 
primitive concepts. 


10.9 The ABox: modeling specific states of the world 

So far, we have concentrated on describing the conceptual model at the 
level of concepts. In some applications we may want to use our system to 
keep models of specific states of the world - somewhat like a database. As 
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discussed in Chapter 2, this involves stating for each specific individual zero 
or more fillers for its attributes and roles, and asserting membership in zero 
or more concepts (primitive, but also possibly defined). 

One of the challenging aspects of modeling the state of the world with 
Description Logics is remembering that unlike databases, DL systems do 
not make the closed-world assumption. Thus, in contrast with standard 
databases, if some relationship is not known to hold, it is not assumed to be 
false. 

One consequence of this is that any question about the membership of an 
individual in a concept, or its relationship to another individual, has three 
possible answers: definitely yes, definitely no, or unknown. The positive side 
of this is that it allows the modeling of states with partial information: one 
can model that BOOK22 is an instance of Book, and hence has exactly one 
filler for isbnNr, yet not know what that value is. Chapter 12 shows how 
this feature has been exploited in developing a family of DL applications for 
configuring various devices. 

Another consequence of the above stance is that in some cases individu- 
als are not recognized as satisfying definitions when one might expect them 
to. For example, suppose we only know that hasAuthor relates BOOK22 to 
SHAKESPEARE, who in turn is known to be an instance of Englishman. 
This, by itself, is not enough to classify BOOK22 as an instance of concept 
(all hasAuthor Englishman); we must also know that there are no other pos- 
sible fillers for BOOK22’s hasAuthor role - i.e., that BOOK22 is an instance 
of (at-most 1 hasAuthor) - before we can try to answer definitively whether 
BOOK22 is an instance of (all hasAuthor Englishman). Even in this case, if 
the answer is not “yes” , we may get “no” or “maybe” . 

A final consequence of not making the closed-world assumption is that 
there is a clear distinction between the state of the world (out there) and our 
(system’s) knowledge of it. This is reflected by the terminology used above 
(e.g., “we must also know there are no other possible fillers”). As a result, in 
modeling a domain one may find it necessary to specify concepts that involve 
the state of our knowledge base, rather than the state of the world. For exam- 
ple, we might want to find out exactly which books in the KB are not known 
to have a ISBN number. The description (and Book (at-most 0 isbnNr)) will 
not do the job, because the second constraint would conflict with one of the 
the necessary conditions of Book, which is that it must have have exactly 
one isbnNr. What is happening here is that the at-most 1 constraint con- 
cerns the state of the world, while the at-most 0 condition involves the KB’s 
knowledge of the world. To deal with this, we need some form of epistemic 
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operator, so we can define the concept 

UnknownlsbnBook = (and Book (at-most 0 (known isbnNr))). 

The general problem of adding an epistemic operator to Description Logics 
is considered in [Donini et al, 1998a], but this is not available in currently 
implemented Description Logics. A “hack” would be to introduce for such 
roles a subrole, whose identifier indicates its epistemic nature: 

knownToHaveAuthor C hasAuthor 

and then be sure to assert fillers only about the “known” variant. Unfortu- 
nately, there is no way to tell a Description Logic that such roles automati- 
cally have the “closed- world assumption” . 


10.10 Conclusions 

There are a wide variety of sources that discuss the application of object- 
oriented approaches to modeling a domain. The same principles apply to 
conceptual modeling in general. For this reason, we have concentrated here 
on some of the more subtle ontological issues that arise during modeling, 
and the different ways in which these can be encoded in Description Logics. 
In some cases the issues examined were suggested by features of Description 
Logics themselves. 

In the process, we covered most of the kinds of questions that would have 
to be addressed while modeling something like the library domain, and un- 
covered some of the strengths and also some of the weaknesses of Description 
Logics in representing this conceptual model. The latter include difficulty in 
representing (structured) values, constraints related to the dynamic aspects 
of the domain, certain forms of “inheritance” (e.g., for materialization), and 
meta-information. These were balanced by the multitude of features deal- 
ing with primitive and defined concepts, necessary and sufficient conditions 
for concept specification, and the treatment of roles as first-class citizens in 
subclasses and composition. 

Probably the biggest problem in developing an appropriate conceptual 
model for a domain is that of testing it for correctness and completeness. 
The former is supported by the reasoning and explanation facilities provided 
by Description Logics. The latter, as usual, is much more difficult to achieve. 
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Abstract 

This chapter reviews the application of Description Logics to software engi- 
neering, following a steady evolution of DL-based systems used to support 
the program understanding process for programmers involved in software 
maintenance. 


11.1 Introduction 

One of the first large applications of Description Logics was in the area 
of software engineering. In software, programmers and maintainers of large 
systems are plagued with information overload. These systems are typically 
over a million lines of code, some approach fifty million. The size of the 
workforce dedicated to maintaining these enormous systems is often over 
a thousand. In addition, turnover is quite high, as is the training invest- 
ment required to make someone a productive member of the team. This 
seems, on the surface, to be a problem crying out for a knowledge-based so- 
lution, but understanding precisely how Description Logics can play a role 
requires understanding the basic problems of software engineering “in the 
large” . 


11.2 Background 

The three principal software maintenance tasks are pro-active (testing), re- 
active (debugging), and enhancement. Central to effective performance of 
these tasks is understanding the software. In the 1980s, cognitive studies 
of programmers involved in program understanding [Soloway et al, 1987] 
revealed two things: 
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1. Programmers typically solve problems by realizing “plans” in their programs. 
This seems to tie the notion of program understanding to plan recognition 
[Soloway et al., 1986]. 

2. Delocalized plans (plans which are not implemented in localized regions of code) 
are a serious impediment to plan recognition, for both humans and automated 
methods [Soloway and Letovsky, 1986]. 

While these observations were interesting, the studies from which they 
were derived were slightly flawed from the industrial perspective described 
above: the subjects of these studies were almost exclusively students work- 
ing alone with small domain- independent programs (i.e. , sorting, searching, 
etc.). It was not clear how these results applied to experienced programmers 
working in teams with huge domain-specific programs. 

An ambitious effort launched by AT&T [Brachman et al, 1990] attempted 
to address this problem by studying maintainers of a large software system, 
and measuring the time they spent performing different categories of tasks. 
What they found was a bit startling: up to 60% of the time was spent 
performing simple searches across the entire software system. A part of what 
was termed discovery , and as pointed out later in [Welty, 1997], the need 
for these searches was the result of the delocalization not only of plans in 
software, but of information in general; information a maintainer needs to 
understand a section of code is frequently not found in the vicinity of that 
section of code, but may be before or after in the file, in a different file, 
in a different directory, etc. For a large software system whose source code 
is spread out over a large number of files in a deep and complex directory 
structure, finding something as simple as, e.g., the definition of a datatype, 
with tools such as find (the Unix program that runs another program on 
all files recursively down a directory structure) and grep (the Unix program 
that searches files for strings) was both difficult and time-consuming. 

Another more comprehensive study was performed by MCC around the 
same time [Curtis et al, 1988], which concluded, among other things, that 
a prerequisite to understanding the software is understanding the domain 
in which the software operates and is a part - if you don’t know what a 
“dial tone” is, you can’t be expected to debug the code that generates a dial 
tone. 


11.3 LaSSiE 

In an attempt to have a direct impact on the maintenance group, the re- 
searchers at AT&T developed the notion of a Software Information System 



404 


C. A. Welty 


LASSIE-THING 



H-FILE 


Fig. 11.1. The LaSSiE code-level ontology. 


THING 



Fig. 11.2. The LaSSiE telephony ontology. 


(SIS) [Brachman et al., 1990]. An SIS is basically an information system 
which treats the software system source code itself as data, and stores re- 
lationships that can provide the information maintainers frequently search 
for during discovery. 

The first SIS, LaSSiE [Devanbu et al., 1991], was developed to assist the 
understanding of AT&T’s Definity 75/85 software system. Influenced by 
their own study and that of MCC, it contained two components: a domain 
model and a code model. 

The code model was implemented with a simple ontology of source code 
elements, shown in Figure 11.1, which was derived empirically from the basic 
kinds of searches maintainers performed. The knowledge base (the actual 
assertions about individual functions, files, datatypes, etc.) was populated 
automatically from the source code. 
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The domain model was reverse engineered from the code and contact with 
the domain experts, and contained knowledge about the telephony domain, 

i.e. the things the software system dealt with. These included entities such as 
telephones, microphones, cables, cable-trunks, etc. A sample of the ontology 
is shown in Figure 11.2. 

One of the most interesting aspects of this work, and perhaps the most sig- 
nificant from the perspective of exploring Description Logics, is an analysis 
of the differences between these two models. The code model was founded 
on a very simple ontology, containing perhaps twenty concepts, and was 
populated with a large number of individuals, on the order of thousands (at 
least one for each file, datatype, function, and variable in the system). The 
domain model had a large and complex ontology, containing perhaps two 
hundred concepts, but very few individuals. 

The reason for this difference was that the trivial searches that character- 
ized software discovery were performed for two reasons: 

1. Discovering specific information about the software, e.g., what is the datatype 
of the variable dial-tone? 

2. Discovering specific information about the domain, e.g., what is a dial tone? 

In case 1, the maintainer requires specific information about the software, 
and thus raw data that represents that information is required. For example, 
by far the most common question asked during discovery is, “Where is this 
variable used?” [Welty, 1997]. Normally, a maintainer would grep for the 
variable in the rest of the code to find the answers to this question, and as if 
this didn’t take enough time and effort, the results would have to be pruned 
by hand to remove various kinds of “semantic noise” such as: 

1. variables with longer names that include the desired variable name 

2. names of functions that include the desired variable name 

3. comments that include the variable name 

4. other non-variable string matches. 

In this particular case, the amount of semantic noise was quite high as a 
result of mandated naming conventions whose intent was to make the source 
code easier to understand (semantic noise, also known as false positives , is 
a general problem with string-based search methods, and will be discussed 
further in Chapter 14). 

The SIS code model immediately solved these problems by identifying 
“variable” as a semantic category (as well as file, function, etc. See Fig- 
ure 11.1). This meant, quite simply, that where a string search for places in 
which e.g., the variable error-value was used might yield such unwanted 
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results as: compute-error-value, display-error-value-result-code, 
error-value-lookup-table, etc., limiting the search to variables would 
remove up to 80% of the noise. 

In addition to trivially being able to restrict searches to specific categories, 
other information that could be extracted automatically was mined from the 
code. For variables, it is simple to automatically determine: 

• the file it was defined in 

• each function in which the variable was used. 

In addition, for each function, information was extracted regarding the 
file it was defined in. From this, a simple inference could be made as follows: 

Variable = VusedlnFile. File n VusedlnFunction. Function n 
usedlnFile = usedlnFunction o definedlnFile. 

In other words, if a function uses a variable then the variable is used in 
the file that function is defined in; this produces all the locations where the 
variable is used. In this manner, the LaSSlE system augmented the basic 
data in a number of ways through inference. 

A similar and nearly as common maintenance task was, e.g., after modify- 
ing a function, searching for all the places that function is used to see if the 
changes affect other sections of the code. Information about what functions 
call others (i.e., the call graph) was also kept in the code model, and an 
expression similar to the one in the example above can be used to derive all 
the files in which a function is called. 

The code model alone was able to simplify several of the common discovery 
tasks maintainers experienced during code modification, but as suggested 
in case 2 of the reasons for engaging in discovery listed above, there are 
other reasons for a maintainer to be searching through the code. For these 
cases, in which domain information is the desired result of a search, a robust 
description of the domain is required, and was provided by the domain model 
(see Figure 11.2). 

For example, a maintainer may want to know what kinds of actions a 
user of the system can take by themselves. To answer this question from 
the code - the usual approach before LaSSlE - would be quite difficult. One 
method might be to grep through the code for the string “user” - hoping of 
course that the documentation is up to date or consistent with respect to 
user actions. Clearly the semantic noise would be quite high in such a case. 

Another approach might be to start with a piece of code the maintainer 
is familiar with, and draw some clues from that for where to look next. 
The point here is that, whereas for code- model queries the goal is quite 
specific, domain-oriented queries are not, and imply a lot of time browsing, 
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searching for new ideas, etc. The code is organized around specific functions, 
not around specific domain concepts, and of course multiple “views” of the 
code are not supported. 

To address this type of need, the LaSSiE domain model expressed knowl- 
edge about the domain of telephony. It presented numerous key concepts 
that let maintainers view the knowledge in the code in a variety of different 
ways. The domain model was mostly terminological, since it was a descrip- 
tion of the things that the software could do. An action concept, such as 
“generating a dial tone” was a description of the action, whereas an individ- 
ual would be an actual action of generating a dial tone at some fixed time. 
These individuals did not normally exist in the domain model, except as 
examples. The concept would roughly be 

GenerateDialToneAction = Action n VinitiatedBy.LatBox n 

Vfollows.OffHookAction n 
Vrecipient.LocalPhone n ^ 1 hasConnection. 

In other words, a “generate dial tone action” is an action that is initiated 
by a local telephone service following an “off hook action.” The recipient of 
the product of the action (the dial tone) is a phone for which a connection 
has been allocated. 

Other domain concepts described things that the software reacted to, such 
as 

OffHookAction = Action n VinitiatedBy. User n ^ 0 follows n 

Vrecipient.LatBox n Vactivates.AllocateConnectionAction. 

In other words, an “off hook action” is an action that is initiated by a user 
(more commonly the result of pressing a button these days than lifting the 
receiver off the hook). It follows no previous action, and the recipient of the 
product of the action is the local telephone service (on which the software 
is running). The action activates a search for a connection. 

Returning to the randomly chosen example above, the maintainer looking 
for all actions that can be initiated by a user would simply enter a query 
such as 


Action n VinitiatedBy.User 

and the system would find all the concepts subsumed by that expression. 
LaSSiE contained a facility for defining new domain concepts identified by 
maintainers during discovery, and adding them to the domain model by 
simply assigning them a name (e.g., USER- ACTION in this example). 

While these two models independently solved existing problems, it soon 
became clear that integrating the two models was an important requirement. 
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Using the tool exposed the fact that most domain queries were followed by 
code queries. For example, after exploring the domain model to discover the 
significance of a “connect action” , the maintainer will typically ask, “What 
are the functions that implement it?” In addition, classifying software com- 
ponents by their relevance in the domain was viewed to be a very significant 
bit of functionality, as this permitted components to be found and retrieved 
with this information - something that was not previously possible. 

This integration between the two models made it possible to use sub- 
sumption to find different software objects. For example, all functions that 
implement connect actions would be 

Function n Connect Action. 

Variables used in functions that implement user actions would be 

Variable n VusedlnFunction.UserAction. 

The LaSSlE system underwent steady development for several years at 
AT&T, and was shown to cut down on the time maintainers spent in dis- 
covery. In order to further improve the process, it was observed that: 

• The connection between the domain and code models needed to be made by hand. 
This was time-consuming to create, and difficult to maintain since the domain 
model changed over time as new features were added to the software system. 
Maintainers began to lose faith in the domain model, as a result, and usage 
deteriorated. 

• The code model, though extremely simple, was used far more frequently than the 
domain model, and became an important part of every maintainer’s tool set. It 
did not, however eliminate the searches maintainers made, and therefore did not 
completely replace find and grep. 


11.4 CodeBase 

Because the code model proved quite useful and easy to maintain, the de- 
mand for it began to increase. This introduced two problems for the LaSSlE 
SIS: 

• Like all DL-systems, it was based on main memory. The software contained many 
thousands of functions, variables, and files. More importantly, the complexity of 
the function call graph, variable usage graph, and location maps, exceeded one 
million. It was not possible to store this amount of information in main memory 
of any computer at that time. 

• The natural language interface, while simple and easy to understand, did not 
facilitate using the system quickly. One still had to compose a proper query and 
type it in. If the result of one query were to be used in another, the maintainer had 
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to re-type the name(s) of the concepts or individuals involved. Increased usage 
demanded a better user interface. 

The CodeBase system [Selfridge and Heineman, 1994] offered solutions 
to both of these problems. Perhaps the most significant achievement was the 
development of a system for off-line storage of individuals. The relatively 
small code-model TBox was always kept in memory, but individuals were 
kept on a disk, in a technique similar to virtual memory. The difference was 
in the heuristics used for predicting what portions of the ABox to pre-load. 

Whereas a virtual memory system normally uses heuristics based on tem- 
poral or spatial proximity, for a knowledge base like LaSSiE, this was not 
relevant. The location of an individual in physical memory was no indication 
of its relevance to other individuals near it in physical memory. 

The heuristics for virtual memory are based on the empirical observation 
that when one location is accessed, it is probable that the next access will be 
to a nearby location in memory. The LaSSiE developers observed that, in a 
DL ABox, when an individual is accessed it is probable that the next access 
will be to one of its role fillers, or to objects along some role path from the 
accessed individual. Because a role may have many fillers, and because an 
individual may have many roles, there is no way to arrange the individuals 
in memory so that the normal virtual memory heuristics will be efficient. 

CodeBase also provided numerous graphical tools for viewing and brows- 
ing the information in the knowledge base. While this is less significant from 
the general perspective of Description Logics, it is important from the stand- 
point of developing knowledge-based systems. One must never forget that 
these systems interact with people, and can not be considered as viable 
systems unless the human is “in the loop” . 


11.5 CSIS and CBMS 

Development of LaSSiE was eventually halted by the trivestiture of AT&T 
in 1995. Research into software information systems did not stop, however, 
and Description Logics have played an important role in this continued de- 
velopment. 

Two issues were brought to light by the LaSSiE system: 

• The deterioration of the domain model over time was another manifestation of 
the classic software documentation problem: the same information being stored 
in different ways. The code model stayed relevant because it was automati- 
cally generated from the only thing that had to be maintained: the software. 
It did not, therefore, need to be maintained separately to remain accurate. The 
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documentation and the domain model were different representations of the knowl- 
edge that was, perhaps implicitly, in the code. These representations always lagged 
the “real” one, since they had to be maintained independently. 

• The delocalization of information in software, which is the central obstacle to 
code understanding, required new ways of viewing the code. Looking at code on 
the screen, analogously to the heuristics for operating system virtual memories, 
is inherently two-dimensional. It does not allow relationships between code-level 
entities to be viewed, or localized. 

The first step in determining how to address these problems was to per- 
form further studies of programmers involved in discovery to gain more de- 
tailed insight into specifically what they were doing. One such study, in this 
case of programmers maintaining a moderate-sized object-oriented software 
system, found that the most common high level queries were: 

1. Where is this variable modified? 

2. What are the available slots and methods on this instance? 

3. What is the datatype of this variable or function? 

4. What are the superclasses of this class? 

5. What does this function return? 

6. Does this function have side-effects? 

7. Is this datatype used? 

Clearly, to provide answers to questions like these requires far more fine- 
grained information about the software than simply the locations of the def- 
initions. Furthermore, this study confirmed that object-oriented languages 
actually increase understanding problems by delocalizing much more infor- 
mation than their imperative predecessors [Huitt and Wilde, 1992], Inheri- 
tance, in particular, spreads method and slot (instance variable) declarations 
up the class hierarchy, making it harder to find answers to questions about 
class composition, among other things. 

These issues spurred research into Comprehensive Software Information 
Systems [Welty, 1995], which soon became Code-Based, Management Sys- 
tems. The idea of CBMS was to define the most precise level of granularity 
of representation needed to have complete knowledge of the software sys- 
tem in the knowledge base. In other words, to have the knowledge-based 
representation be the artifact that is maintained. 

From a DL perspective, such a comprehensive representation of software 
in a knowledge base required the ability to deal with large amounts of in- 
formation efficiently. In addition, such a deep representation made it pos- 
sible for a wide range of inferences that were well-suited for subsumption 
reasoning. 
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SOFTWARE-THING 



SOFTWARE-OBJECT SOFTWARE-VALUE SOFTWARE-ACTION 



Fig. 11.3. A simplified code-level ontology. 


A CBMS is based on a full-scale parse of the code to construct an abstract 
syntax tree (AST), which is basically the parse tree. The AST has all the 
information of the source code, such that the source code can be completely 
generated from the AST. The AST is augmented with semantic information 
that can be derived automatically from the syntax. In C+- 1-, for example, 
we know that the left side of an assignment operator is the variable to be 
changed, and the right side is the new value. 

The ability to represent everything in the code requires a deeper ontol- 
ogy of code-level software elements than the original LaSSlE ontology, that 
includes statements, blocks, conditions, etc. In fact, every syntactic element 
of the programming language is in the ontology. A simplified ontology for 
an object-oriented language is shown in Figure 11.3. 

In addition to these concepts representing the syntactic elements of the 
source language, roles were use to relate instances of these concepts to each 
other for control flow, data flow, call graphs, etc. For example, take the 
following C++ code fragment: 

void group_deliver ( 

MAIL_MESSAGE message, 

GROUP group) 
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Fig. 11.4. A CBMS representation of the code fragment. 


{ LIST members; 

members = get _members (group) ; 
while (! empty (members) ) { 

ind_deliver (message , car (members ) ) ; 
members = cdr (members) ; 

} 

> 

The CBMS representation of that fragment is shown in Figure 11.4. Note 
that Figure 11.4 shows only the ABox corresponding to the small code frag- 
ment, and that role fillers are shown as binary relations. 

With an interface that showed individuals in the code representation with 
role fillers displayed as hypertext links (see [Welty, 1996a]), this ontology 
alone localizes far more information than the standard text view of software 
displayed in an editor window. Again, an editor window localizes only the 
control flow information; a maintainer looking, e.g., at the code fragment 
shown above, only sees the text. The lines are arranged in roughly control- 
flow order. 

Using a CBMS representation, a maintainer’s view is focused on a particu- 
lar object, such as the assignment statement on the first line of the function. 
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This view would be: 

ASSIGNMENT-STATEMENT-23 : 

implementation-of : {FUNCTION-03 : group_deliver} 

next: {WHILE-STATEMENT-14} 

changes: {LOCAL-VARIABLE-16: members} 

new-value: {FUNCTION-INVOCATION-34: get-members (group) } 

In this kind of view, anything in {. . . } is a hypertext link to a similar 
description of the individual named in the link, and localization takes on a 
new meaning: the number of hypertext links a desired piece of information 
is from the current context (individual being viewed). For example, infor- 
mation about control flow is accessible through a chain of next links, but in 
addition, information about data flow is accessible through the new-value 
link, about the function being implemented, about the variable being used, 
etc. 

Another advantage of the CBMS approach is that reasoning can be em- 
ployed to augment the data and automate the localization of even more 
information. In existing work in Classic, three types of reasoning were 
employed: 

Role inverses. Every role in the ontology has an inverse, and this pro- 
vides a tremendous amount of simple bookkeeping information use- 
ful to maintainers. In the example above, the changes role is filled 
through parsing with members, and the inverse relationship, that the 
variable members is changed By ASSIGNMENT-STATEMENT-23, is added as 
well. The power of this simple inference can not be overstated. Studies 
showed that this was the most useful kind of information the system 
provided, as it answered the most common question asked by maintain- 
ers. 

Path tracing. Many useful pieces of information were a few clicks away, 
but would be more useful if brought within one click (i.e. , one link). 
A simple set of forward chaining “filler” rules in Classic are capable 
of handling this. For example, it is also useful to know within which 
functions a variable is changed. Without inference, the maintainer must 
click on the changed By role for a variable to get to the statement that 
changes it (or statements), and then must click on the implementationOf 
role for the statement to get to the function. Instead, with “path- 
tracing rules”, we can fill the changedlnFunction role automatically 
with all the values from the path (changedBy implementationOf). Thus 
in our example we can conclude that members is changedlnFunction 
group_deliver. 
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SOFTWARE-THING 


SIDE-EFFECT-THING 



OUTPUT-SIDE-EFFECT 

Fig. 11.5. The side-effect ontology. 

Subsumption. With subsumption reasoning, membership in a number of 
useful classes can be inferred for individuals representing pieces of the 
code. For example, the concept GlobalAssignmentStatement is defined: 

GlobalAssignmentStatement = AssignmentStatement n 

Vchanges.GlobalVariable 

which allows all the assignment statements that modify global variables 
to be identified. 

The most compelling result that came out of the CBMS work so far has 
been the automatic detection of side-effects, answering the sixth most com- 
monly asked question. This detection was not originally believed to be pos- 
sible. To simplify the discussion, we assume a pure object-oriented language 
without pointers or call- by-reference parameters. The latter can be handled 
in a similar way, the former is still believed to be undecidable. 

There can be two kinds of direct side-effects in a method: a change to 
a global variable, and any sort of output. A third kind of side-effect is a 
call to a method that has a side-effect. In this case, the side-effect does not 
actually occur within the calling method, yet a side-effect will occur when 
the calling method itself is invoked, so it can be important to discover it. 
A change to a global variable occurs whenever that variable appears in an 
assignment statement as the variable to be changed. 

The CBMS ontology contains a fairly simple extension which can au- 
tomatically detect side-effects on global variables and calls to methods 
with side-effects. Output methods must be specifically identified as such 
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in order that calls to them may be recognized. This is not really a prob- 
lem, since output functions are generally part of a support library which 
would be provided to any developer. The extension begins with a new 
part of the code-level ontology, shown in Figure 11.5. This new ontology 
of primitive concepts fits under the SoftwareThing concept. Next, any in- 
dividual of GlobalAssignmentStatement (defined above) is a side-effect - an 
AssignmentSideEffect. 

In order to put individuals of AssignmentSideEffect into the side-effect 
ontology shown in Figure 11.5, a forward chaining rule is added: 

AssignmentSideEffect =>• DirectSideEffect. 

This rule is required because if the relationship it specifies were part of 
the defined concept, being a direct side-effect would become a sufficient 
condition for recognizing assignment side-effects, and they would never be 
found automatically. In other words, the rule says “once an assignment side- 
effect is recognized, it should be also be classified as a direct side-effect”, 
whereas putting direct side-effect after assignment in the defined concept 
definition would say, “An assignment side-effect must already be known to 
be a direct side-effect to be recognized.” The latter is not productive. 

At this point we can classify all assignments that change global variables 
as assignment side-effects and direct side-effects. The next addition is a set 
of roles that will help identify the methods that contain these side-effects: 
hasDirectSideEffect, its inverse directSideEffectOf, and their role parents 
hasSideEffect and SideEffectOf. With these roles defined, a path-tracing rule 
is added for DirectSideEffect that says directSideEffectOf = implementationOf. 
In other words, the directSideEffectOf role should be filled with the value in 
the implementationOf role of the assignment. Through the role hierarchy, 
this also adds the SideEffectOf role, and through the inverse, the individual 
of Method that fills this role gets the hasDirectSideEffect and hasSideEffect 
roles pointing back to the assignment. 

With these inverse roles filled in, we can create a new defined concept to 
recognize methods with side-effects: 

MethodWithSideEffects = Method n ^ 1 hasSideEffects 

and a more specific one for methods with direct side-effects: 

MethodWithDirectSideEffects = Method n ^ 1 hasDirectSide Effects. 

Note that the second concept will automatically be classified under the first. 
Now, as a result of the rules that added the hasSideEffect links, every method 
that has in its implementation a slot assignment side-effect will have at least 
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one filler in its hasDirectSideEffects role, and will be classified as a method 
with direct side-effects. 

The next case is detecting indirect side-effects, which first requires rec- 
ognizing invocations of methods that have side-effects (in object-oriented 
terms, a method invocation is a message): 

MessageSideEffect = Message n VcallMethod.MethodWithSideEffects. 

Individuals of this new concept can be recognized since all methods with 
side- effects have been found with the previous two defined concepts. A simple 
forward chaining rule then links these message side-effects back into the 
side-effect ontology: 

MessageSideEffect =>• IndirectSideEffect. 

Next we define two more roles: hasIndirectSideEffect and its in- 

verse indirectSideEffectOf, and make them children of hasSideEffect and 
SideEffectOf, respectively. Once these roles have been defined, and the mes- 
sage side-effects have been found, we can identify all the methods that have 
them in a similar manner to assignment side-effects. First, create a path- 
tracing rule for IndirectSideEffect: indirectSideEffectOf = implementationOf 
which will fill in roles. Now we identify all these methods with indirect 
side-effects with the concept 

MethodWithlndirectSideEffects = Method n ^ 1 hasIndirectSideEffects 

The final step is simply to link methods with side-effects into the side-effect 
ontology with one last forward chaining rule: 

MethodWithSideEffects => SideEffectThing 

The addition of this rule basically creates the side-effect ontology shown in 
Figure 11.5. 

Not only do these definitions identify functions with side-effects, but they 
also lead a maintainer directly to the side-effect itself. The point here, from 
a software understanding perspective, is that subsumption makes it possible 
to localize information that otherwise would be difficult (or at least time- 
consuming) to discover. 

The inferences for finding side-effects are clearly very deep, yet the devel- 
oper or maintainer need not be aware of them. All these side-effect inferences 
come with no extra work by the developer or maintainer at all. In fact, an- 
swers to all of the top questions asked by maintainers during discovery can 
be localized to within one link, therefore one mouse click in the simple hy- 
pertext interface described. 
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Abstract 

Description Logics are used to solve a wide variety of problems, with con- 
figuration applications being some of the largest and longest-lived. There is 
concrete, commercial evidence that shows that DL-based configurators have 
been successfully fielded for over a decade. Additionally, it appears that 
configuration applications have a number of characteristics that make them 
well-suited to DL-based solutions. This chapter will introduce the prob- 
lem of configuration, describe some requirements of configuration applica- 
tions that make them candidates for DL-based solutions, show examples of 
these requirements in a configuration example, and introduce the largest 
and longest-lived family of DL-based configurators. 


12.1 Introduction 

In order to solve a configuration problem, a configurator (human or machine) 
must find a set of components that fit together to solve the problem speci- 
fication. Typically, that means the answer will be a parts list that contains 
a set of components that work together and that the system comprising the 
components meets the specification. This task can be relatively simple, such 
as choosing stereo components in order to create a home stereo system. The 
problem can also be extremely complex, such as choosing the thousands of 
components that must work together in order to build complicated telecom- 
munications equipment such as cross-connect devices or switches. 

One important factor that makes configuration challenging is that making 
a choice for one component typically generates constraints on other compo- 
nents as well. For example, a customer who chooses a receiver that only 
supports up to four speakers may not conveniently support a surround 
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sound system with a subwoofer (since this would require more than four 
speakers). 

Configuration continues to have strong interest in the academic and com- 
mercial communities. It has been a prominent area in artificial intelligence 
at least since the Rl/XCON [McDermott, 1982] work on configuring com- 
puter systems. Since then, many configuration systems have been built in 
domains including communication networks, trucks, cars, operating systems, 
buildings, furniture layout, and even wine properties to match a meal de- 
scription. Today, there are active mailing lists, workshops and conferences 
(such as the configuration workshops at ECAI-2002 [ECAI, 2002] IJCAI 2001 
[Soininen et al. , 2001], AAAI’99 [Faltings et al. , 1999], and the Fall Sym- 
posium Workshop on Configuration [Faltings and Freuder, 1996]), special 
issues of journals (such as IEEE Intelligent Systems [Faltings and Freuder, 
1998] and Artificial Intelligence for Engineering Design, Analysis and Man- 
ufacturing [Darr et al., 1998]), and research groups at a number of uni- 
versities and companies. Approaches include constraints, expert systems, 
model-based reasoning, and case-based reasoning as well as Description 
Logics. 

Configuration is an important and growing commercial concern. There are 
a number of companies dedicated to configuration such as Trilogy, Calico, 
etc. Other companies in broader markets such as the enterprise integration 
software companies, Baan and SAP, have a major emphasis in configuration. 
Companies that sell complicated products, such as computers, are providing 
their own configurators (e.g., the Dell personal computer online configura- 
tors). There are spinoff companies of general configuration companies that 
are aiming at particular domain areas, such as PCOrder (a spinoff of Trilogy 
focusing on personal computer configuration) . There are also some domain- 
oriented companies that include configuration as a major component such 
as CarsDirect’s configuration of United States consumer car orders. 

Although the commercial configuration market may appear to be a recent 
event since it has been exploding recently, it does have at least a decade of 
history. Trilogy, for example, one of the earlier companies focusing primarily 
on configuration, was founded in 1989. Forrester Research reports that the 
configuration market was valued at 8 billion dollars in 1997 and it predicts 
that the market will grow to 327 billion in 2002. Configuration is also seen 
as important by companies not originally classifying themselves as “con- 
figuration companies”. In a study of fifty eCommerce executives from top 
firms in the business-to-business and business-to-consumer space, Forrester 
Research found that search and configurators were considered the two tools 
most critical for customer support [Koetzle et al., 2001]. 
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The DL community has been addressing configuration needs for over a 
decade as well. Owsnicki-Klewe [1988] presented a view of configuration 
as a consistency maintenance task for Description Logics and AT&T inde- 
pendently began work in 1988 on its family of configurators for telecom- 
munications equipment [Wright et al, 1993; McGuinness et al., 1995; 
McGuinness and Wright, 1998b; 1998a]. Similarly Ford Motor Company has 
had a DL-based configurator [Rychtyckyj, 1996] in the field for over 10 years. 
Others in the DL area have explored Description Logics for configuration as 
well, e.g., [Buchheit et al., 1994c; Kessel et al., 1995]. 


12.2 Configuration description and requirements 

In this chapter, we will be considering large-scale configuration problems. If 
one only has a small number of constraints to satisfy and a small num- 
ber of possible component choices, then any somewhat reasonable solu- 
tion will work. If however the final product is complicated and there are 
thousands of choices and constraints, then there is more need for a well- 
suited solution. We will consider the generic configuration problem where 
there is a complex artifact being assembled from components. Potentially 
the components have subcomponents, and thus the artifact may be mod- 
ular or hierarchical in nature. Also, each of the components typically has 
a number of properties, such as power restrictions, connections to other 
components, etc., and thus components may be tightly interconnected. If 
one looks at modern configuration descriptions [Fleischanderl et al., 1998; 
Juengst and Heinrich, 1998], one can see only large, interconnected, tightly 
constrained, complex systems. 

The input description for the configuration problems we will consider will 
be a specification of a complex, probably highly interconnected system. The 
input should be able to be entered incrementally by a user as well as being 
able to be uploaded from sales programs. The input specification may be: 

• incomplete 

• ambiguous 

• incrementally evolving 

• granular to different levels of specificity 

• inconsistent 

• entered in any arbitrary order 

• interconnected 

• nested with complex structure. 

The output for the system, in its simplest form, will be some kind of parts 
list. The parts list may be organized hierarchically so that there is a parts 
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list of high level components (such as bays in switching systems or speaker 
sets in home theater systems) as well as a detailed parts list of the individual 
components. In this chapter, we will only address configuration and not the 
related area of parts layout. 

The output of the system should be: 

• correct 

• complete 

• consistent (with respect to other parts, preferences, pre-existing components in 
the customer’s environment) 

• modifiable 

• understandable / explainable 

• capable of being queried 

• interconnected and interoperable with related data. 

The configurator needs to accept the problem input along with any 
previously entered domain information concerning valid configurations. It 
must then check the constraints it has (calculating the constraints that are 
implicit in the input data from the input and background information) in 
order to start building a parts list. It may find that a complete and correct 
parts list may not be built from the given input. In actuality, it is common 
for the problem specification to be either overconstrained (i.e., contain a 
contradiction such as “I want a pair of speakers that is of the highest quality 
available yet I do not want to pay more than fifty dollars for them”) or 
underconstrained (i.e., “I want to buy a high quality stereo system”). In the 
first case, the configurator needs to identify the source of the conflicting in- 
formation and determine (probably along with user input) which conflicting 
constraint (s) to relax. In the second case, the configurator needs either to so- 
licit more specific information from the user, or to generate a list of possible 
configurations, or both. If the configurator makes arbitrary choices for the 
user (e.g., it chooses some receiver for the stereo system yet there were many 
possible choices), then it needs to make it possible for the user to change 
the arbitrary choices and also to find out which choices were arbitrary and 
which choices were mandated by constraints. Additionally it needs to let 
the user enter partial additional input that would further constrain the 
choices. 

The configurator also needs to accept information from multiple data 
sources. There will be a number of databases with which a configurator 
may need to interact. Typically, there will be databases of parts and prices, 
other databases of parts and availability, and possibly many other databases 
with user information or just information about different product families. 
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It is likely that information (such as pricing and availability) will change 
frequently. Also, there will be information concerning what parts are com- 
patible together and how the choice of one part constrains the choices of 
other parts. These might be considered the configuration rules. Although 
these rules might not change on a frequent basis, modifications are typically 
necessary. The rules may come from multiple sources as well. They may 
need to be imported from many different source languages and they may 
need to be input by people who have no training in computer science, let 
alone knowledge representation systems. 

Finally, the system may be long-lived and thus require support and main- 
tenance. It may be necessary to staff a help desk to help users of the system. 
The customer service representatives may know very little about any one 
individual product about which they are answering questions (because they 
are supporting a large number of products). The technical staff maintain- 
ing the individual configurator may not include people who originally built 
the system, and over time, it may not even include people who know much 
about the product (although they may be quite capable of researching the 
product if necessary). Also, the technical staff may need to generate new 
configurators for updated or similar products. 

We might summarize the requirements from the input, output, and core 
configurator requirements starting from the requirements presented in one 
configurator family of applications [McGuinness and Wright, 1998b] and 
augmenting them slightly here. A solution methodology should have the 
following properties: 

• object-oriented modeling 

• rule representation, organization, and triggering 

• active inference and knowledge completion 

• explanation, product training, and help desk support 

• ability to handle incrementally evolving specifications 

• extensible schemas 

• reasoning mechanisms that handle incomplete or ambiguous information 

• inconsistency detection, error handling, and retraction 

• modularity 

• maintainability. 

This list of needs represents those in many complicated reasoning tasks. 
Although we could argue that this general architecture and approach is 
more broadly applicable, we will limit our discussion to configuration appli- 
cations. In the next set of subsections, we will describe each of these needs 
with respect to the task of configuring a stereo system (based on the config- 
urator demo by AT&T [McGuinness et ai, 1995; 1998] and mention how the 
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DL-based solution met the need. When useful or necessary, we will mention 
how the need was addressed in the larger Prose configurator family. 

In the stereo configuration application, the goal was to require the user 
to enter a small number of constraints concerning the end system and gen- 
erate a complete, correct, and consistent parts list. Although the system 
would accept a large set of constraints as input as well, the goal was to 
reduce the user’s task and thus require minimal input. The system used 
the user input along with its extensive domain knowledge and parts in- 
formation to determine if the user’s input specification was consistent. It 
used the underlying theorem prover within the DL system to compute the 
deductive closure of the input and generated a more complete input de- 
scription. User input was solicited on the system quality (high, medium, 
or low with associated price ranges) and the typical use (audio only, home 
theater only, or combination), and then the application deduced applica- 
ble consequences. This typically generated descriptions for 6-20 subcom- 
ponents which restrict properties such as price range, television diagonal, 
power rating, etc. A user might then inspect any of the individual compo- 
nents possibly adding further requirements to it which may, in turn, cause 
further constraints to appear on other components of the system. Also, a 
user may ask the system to “complete” the configuration task (even if the 
user specification was incomplete), completely specifying each component 
so that a parts list is generated and an order may be completed. An online 
demonstration of the web configurator application is available at Stanford 
(http://www.ksl.stanford.edu/people/dlm/dls/stereo-demo/) and a 
number of examples are available in the extended online version of the IJ- 
CAI paper [McGuinness et al., 1995] available at: http ://www. research, 
att . com/sw/tools/classic/tm/ij cai-95-with-scenario.html . 

This application is convenient for illustrating our points since it is small 
and in a broadly understandable domain. It is potentially more interest- 
ing than some simple pedagogical examples since it was developed as an 
application that had representation and reasoning requirements that were 
isomorphic to the needs observed in the Prose family [Wright et al., 1993; 
McGuinness and Wright, 1998b] of configurators. The examples in this paper 
can be seen in more detail in [McGuinness et al., 1995; 1998]. 


12.2.1 Object-oriented modeling 

A system that is being configured may be viewed as a structured object com- 
posed of smaller objects. Even our simple example domain of stereo equip- 
ment presents a natural hierarchy of concept descriptions and instances that 
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have a number of properties. We have a top level node like ElectricalThing and 
then have subclasses of that node such as HomeTheaterSystem and Stereo- 
OrVideoComponent. Further, subclasses of StereoOrVideoEquipment might 
include Receiver, Speaker, and Television. Any particular term may have prop- 
erties associated with it. For example, a Television might have a property 
called diagonal (that must be filled with a positive integer), another called 
price (that must be filled with a monetary value), a repairHistory (that must 
be filled with one of the following values: {BAD, OK, GOOD}), a manufac- 
turer (that must be filled with a company), and a height, width, and depth 
(all of which must be filled with a positive number). All of the properties 
might have cardinality requirements on them. For example, there must be 
at least one manufacturer (although possibly more than one manufacturer), 
there must be exactly one filler for the diagonal role, etc. 

In the simple examples so far, we have seen a need for number (cardi- 
nality) restrictions, value restrictions (choosing the type of a filler for a 
role), roles, and class hierarchies. Further we should note in the descrip- 
tion that the objects are compositional. The value restriction on the man- 
ufacturer role is naturally determined to be a company. Companies them- 
selves might have further properties like headquarter locations, CEOs, etc. 
A user might subsequently want to choose speakers made by companies 
in the United States and televisions made by companies headquartered in 
Japan. 

It is argued more extensively elsewhere [McGuinness and Wright, 1998a] 
and in this book in Chapter 10 that Description Logics are convenient mod- 
eling tools for such objects. We can show a simple example of this dia- 
grammatically where a HomeTheaterSystem inherits a price role with a value 
restriction of MonetaryUnit. We might also have a particular HomeTheater- 
System named MY-HTS that is the system we will be building through the 
example. It will also have a price role with some unknown value at the 
moment. We might also have a subclass of HomeTheaterSystem called High- 
QualSystem. In our simple example, this might be defined simply as a home 
theater system that costs at least 6000 dollars. In a Description Logic sys- 
tem, once MY-HTS contains either a price that is over 6000, or contains 
a partial description such as “a minimum price of 8000 dollars” that re- 
stricts the price to be greater than 6000, then it can be recognized to be 
an instance of a HighQualSystem. This kind of automatic recognition and 
organization of terms based on their definitions is a convenience for orga- 
nizing and maintaining partial descriptions and is arguably one reason that 
Description Logics are thought to be particularly useful for modeling and 
maintenance of applications that require object-oriented models. 
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12.2.2 Rule representation 

A knowledge base that contains information about active deductions will 
contain some sort of rules. Typical large configuration systems will contain 
many rules. Also, these rules may change frequently. It is reported that 40% 
of the rules in R1 changed yearly. Thus, support for modeling, organizing, 
and later, maintaining the rules will be important in large configuration 
systems. A simple rule may take the form of “If something is an A, then it 
is a B" . For example, if something is a HighQualSystem, then its television 
is a HighQualTelevision (which has a minimum price and diagonal value), its 
speakers are HighQualSpeakers (which have minimum price restrictions), etc. 
In fact, in our stereo demo, there are dozens of rules that fire once a system 
is determined to be a HighQualSystem. If the minimum price restriction 
were ever removed from the specification requirement, we would want the 
results of those rules retracted automatically (unless the same results could 
be deduced in other ways as well). 

A DL-based system can support modeling of rules described above in a hi- 
erarchical fashion. Rules can be associated at what ever level of the hierarchy 
is appropriate. Thus, we might associate minimum price and diagonal for 
televisions at the level of a HighQualSystem and we might associate repair- 
history restrictions with another concept such as HighReliabilitySystem. If we 
just wanted to have this kind of simple rule encoding, we would not need to 
use a separate mechanism. If one has an encoding scheme that includes nega- 
tion and disjunction (or some other way of encoding an “if-then” rule) , as do 
most of the modern Description Logic languages, then one does not need to 
introduce a separate rule notion. For example, one might encode a simple if- 
then relationship such as (or (not HighReliabilitySystem) GoodRepairHis- 
tory). This states that either something is not a high reliability system or 
it has a good repair history, which is typically viewed as equivalent to “if 
something is a high reliability system, then it has a good repair history” . 

The Description Logic that this example was encoded in (Classic 
[Borgida et ai, 1989; Brachman et ai, 1991; Patel-Schneider et al, 1991; 
McGuinness and Patel-Schneider, 1998]) had a rather limited set of con- 
structors and also had the simple rules introduced above and also more 
sophisticated rules such as those which compute role values based on con- 
text. In some configuration applications of this Description Logic, the more 
sophisticated rules in combination with other constructors have encoded 
expressive rule-based reasoning, and in fact many of the rules in those con- 
figuration system required Classic’s more sophisticated rule representation 
system. The examples we have seen in this chapter only use a simple form 
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of if-then rules. For a more detailed discussion of how powerful these rules 
can be in practice, see [Borgida et al . , 1996]. 

Description logics are not required of course in order to capture rule rep- 
resentation and reasoning; this example simply shows that they can be a 
convenient technique for capturing rules and reasoning with them. 


12.2.3 Active inference 

Description logics deduce logical consequences of information and are thus 
said to provide active inference. In fact, one of the typical patterns of infer- 
ence observed in many DL-based configuration systems includes: 

• asserting new information about an existing term 

• recognizing that the updated term is an instance of a class 

• firing a rule on the term that is associated with the class 

• propagating information from the updated term to related terms. 

For example, let’s consider MY-HTS again. Let it have a hasTelevision slot 
filled with a particular television TV-11. Once it is asserted that the user is 
willing to pay more than 8000 dollars for this system, it is recognized to be 
an instance of the HighQualSystem. The rules associated with that concept 
fire and now it becomes an instance of something that has a television di- 
agonal minimum of 50 inches (or possibly a high definition television with a 
smaller diagonal) and a television price of a minimum of 1000 dollars. These 
restrictions are propagated onto TV-11. 

This kind of deduction chain comprises over 50% of the inferences that 
are done in the stereo configurator example. In this manner, users only need 
to specify a small number of restrictions on their system and they can have 
a large number of deductions performed for them. 

It should be noted that this particular example configurator was built on 
a Description Logic that did not contain default reasoning. Some Descrip- 
tion Logics have been expanded to include default reasoning (i.e., if it is not 
known to be otherwise, use the default rule) [Padgham and Zhang, 1993; 
Baader and Hollunder, 1995a; Quantz and Royer, 1992]. For example, if a 
manufacturer has not been specified for a television, use Sony as the manu- 
facturer. If the underlying formalism had had a default representation, this 
would have been used. 

As the demonstration system was encoded, the stereo configurator used 
two sets of concepts on which to hang rules: a concept for all provably correct 
rules (such as power compatibility) and another concept for the default rules, 
called a “guidance” concept (for more subjective rules such as minimum 
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prices). The deployed configurators on which this system was based actually 
used defaults as completion - at a particular point in the specification input 
process, if information is unknown, then “complete” it using the “default” 
or subjective rules [McGuinness and Wright, 1998b]. This provided one 
very simple method of implementing a kind of “default” as completion that 
can be viewed as one of the simplest forms of default reasoning. 


12.2.4 Explanation 

Customer help desk staff need to be able to help users understand potentially 
everything about a configuration specification and the final parts list. In fact, 
the Prose family of configurators faced extinction had it not been able to 
respond with a full explanation capability. It was evident that consumers 
needed to be able to find out why some particular part was in their final 
system, why it had the particular value restrictions it did, what the possible 
alternatives were, and from what portion of the specification this information 
had been derived. In this simple example, a customer might want to find 
out why the television in the final system costs over 1000 dollars or why it 
has a particular minimum diagonal requirement. The explanation would be 
that a high quality system was requested and high quality systems include 
a suggested minimum diagonal size and a minimum price on their television 
components. 

The demonstration system allows customers to point to particular 
components and ask questions about everything that has been deduced 
about them. It also anticipates the most common explanation questions 
that users ask and provides pulldown menus containing explanation 
question that were dynamically generated based on the item a user 
was pointing to and that a user could just click on to ask quickly. 
An extensive explanation foundation was designed for the underly- 
ing DL-based system in order to support that [McGuinness, 1996; 
McGuinness and Borgida, 1995]. The explanation system provides a 
proof-theoretic foundation for explaining any deduction in terms of proof 
rules and arguments. It also provides an automatic followup capability that 
generates the questions that would lead to this inference being deducible. 
The followup question generation was found to be needed since user studies 
showed that users wanted fairly simple explanations along with the capabil- 
ity to ask followup questions. Further studies found that users appreciated 
help in generating syntactically correct followup questions that made sense 
given the previous question that was asked. The followup questions were 
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automatically generated from the model-theoretic form of the explanation. 

The basic explanation structure was originally done for a normalize- 
conrpare DL-based system but has since been used as the foundation for 
a tableau-based Description Logic [Borgida et al. , 1999] and also a model- 
elimination theorem prover in an implementation of the ATP and JTP the- 
orem proving systems at Stanford University. 

Explanation in general is one of the strengths of Description Logics as 
opposed to some of the other configuration approaches. It may be much 
more difficult to explain a line of reasoning in a typical constraint-based 
approach than it is to filter and prune an inference-rule-based theorem 
prover such as a DL prover. Filtering object presentations and explana- 
tions in Description Logic has also been addressed in [McGuinness, 1996; 
Borgida and McGuinness, 1996; Baader et al, 1999a]. Also, it has been ar- 
gued elsewhere [McGuinness and Patel-Schneider, 1998; Brachman et al., 
1999] that explanation is a requirement for many kinds of applications, but 
is particularly important for configuration systems [McGuinness and Wright, 
1998a]. 

Recent work has been done in constraint-based approaches that starts 
to address explanation in constraint-based configurators. While progress 
is being made, the more interesting constraint-based explanation systems 
[Freuder et al., 2001] utilize extensive domain specific information and are 
not generic solutions to the problem of understanding explanations. 


12.2.5 Evolving specifications 

In many common configuration scenarios, a user begins with an incomplete 
set of specifications for an end product. Configuration applications built to 
support users should take input of the known specifications (in an order that 
is convenient for the user and not just an order convenient for the program), 
and then solicit remaining required input. 

A configurator system should allow mixed initiative input, where the user 
may input the specifications the user is aware of at a particular time and 
the system should request input that it needs to meet a task. Description 
Logics can allow users to input descriptions of end products or individual 
components at any time. For example, in the home theater system, a user 
could specify information about the entire system - such as a requirement 
for the entire system to be high quality - and also could specify information 
about any of the particular components that was known at a particular time. 
The user might, for example, prefer to buy a particular model television 
or might want to set a diagonal size and a number of other constraints 
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on the television, but might not know anything at the moment about the 
restrictions on the DVD player. 

A user interface, such as the one depicted in the stereo example, allowed 
a user to choose components from dropdown menus. The dropdown menus 
were generated on the fly in order to take into account all of the information 
that the system currently had about a component. This was used as a query 
to the database of all components that met that specification. Thus, the 
user was kept from choosing many components that would be incompatible 
with the system that was configured to date. 

The user could also browse the current configuration and delete any re- 
quirements that were stated. (The user was not allowed to delete require- 
ments that were inferred, but was allowed to ask how a particular require- 
ment was deduced, thereby discovering the source of that requirement.) Once 
a requirement was deleted, then new dropdown menus were generated to in- 
clude components that met the current set of specifications instead of the 
previous set. 

This architecture provides a great deal of flexibility for incrementally 
evolving (sometimes non-monotonically evolving) specifications. It worked 
well to provide users with menus of choices that were recalculated as needed, 
with updated component lists that met the current specifications that were 
stated or implied about any component. 

For example, if a user specified a high quality stereo system and then 
decided to choose an amplifier for the system, the configurator would only 
present options for amplifiers that had been determined to be of high quality. 
Description Logics are not the only modeling scheme that supports evolving 
specifications, but this section attempts to point out that they can be used 
rather easily to support evolving configuration specifications. 


12.2.6 Extensible schemas 

Many configuration applications find that information about components 
is continually updated. It is not always the case that only the simple data 
about components is updated but sometimes properties of the components 
change or new properties are discovered after an application has been en- 
coded. Thus, it becomes important to work with a schema or a description of 
a component that can be updated. For example, in our home theater appli- 
cation, when we began development, DVD players were not in the consumer 
market. It later became common for home theater systems to include DVD 
players, and thus our schema needed to be extended with the new class - 
DVDPlayer - as well as with roles that were appropriate for DVD players. 
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This need for updatable and configurable schemas is sometimes a require- 
ment for design. For example, in AT&T evaluation of software, one criterion 
is extensible schemas. Our experience in the deployed Prose and Questar 
configurator family was that products were extended often in practice. 


12.2.7 Reasoning for incomplete information 

Many configuration specifications are almost by necessity incomplete when 
input initially. In large systems, it may be common for one person who may 
be an expert in one area to input specifications for that area while another 
person who is an expert in another area may update the specification later. 
For example, in a two-person household, one person may be much more 
literate in audio quality and thus may input the requirements for speakers, 
while the others may have more interest and knowledge in video displays 
and thus may input specifications for the television (along with its input 
and output requirements). It may be important to allow specification to be 
done across multiple sessions as well. 

One would not want a configurator that could not make deductions un- 
til all of the input requirements had been presented. For example, in the 
stereo system, one would want a configurator that could infer the implica- 
tions of the speaker restrictions on say minimum power requirements for the 
amplifier, even though the television specifications had not been input yet. 

Description Logics have been demonstrated to be useful at determining 
logical consequences of information even when that information is incom- 
plete. They can also be used to determine information that is still required. 
For example, they can determine that two speakers need to be input as parts 
in the parts list before the configuration can be considered complete. Thus, 
it is not enough to say that two high quality main speakers are required but 
the parts list actually needs to have the actual speakers chosen before the 
job is considered complete. 

In the home theater application, there was a one-pane display dedicated to 
showing which final component choices still remained before a configuration 
could be considered completed. The display could be used to view the current 
parts already implied and / or chosen along with the other components yet to 
be chosen. The other components could be clicked on to obtain the current 
description of the component so that a user could view what had been 
derived to date about that component. The application allowed a user to 
save a partial specification of a configuration for further requirements to be 
input at another point. The application also allowed a user to “complete” the 
configuration at any point that would force the system to make consistent 
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decisions for remaining underconstrained components. The user could also 
inspect individual component choices and click on them and see a pulldown 
menu list of alternative choices that the system could have made. The user 
could also click on the component and view a description of the constraints 
that the application had determined must hold for that component. The 
description of the component was what was used to query the knowledge 
base about components that would fit the characteristics. The description 
could also be passed along to another user (or another application) so that 
it could see what constraints had been deduced so far and then have that 
other user (or application) either add new constraints or make the ultimate 
product choice, thereby facilitating collaborative configuration. 


12.2.8 Inconsistency detection 

Configuration applications should minimize the chances for users to generate 
inconsistent specifications. The stereo configurator, for example, uses the 
information that can be deduced about any particular component in order 
to form a query to the database about possible components. This greatly 
limits the chances that a user may choose a component in their system that 
will cause an inconsistent specification to result. The deployed application 
did not take a greater step, however, before choosing to put a component on 
a pulldown list. It did not make the hypothetical choice of the component 
for the user and then check to see if the remaining components that were 
still unspecified could be completed with a component in the database. (Of 
course, this would be an exponential search with the remaining components 
yet to be specified.) Thus the deployed example could still allow a user to 
generate an inconsistent specification - the application just made it more 
difficult for this to happen. The back end reasoning system was required to 
determine when an incremental specification became inconsistent. 

Sometimes users of other deployed configurators generate a large set of 
constraints and want to input them into other (connected) configuration ap- 
plications. Thus one additional requirement on a user friendly configurator 
(that is expected to interact with other configuration applications) is for 
the reasoner to take input constraints and determine if they are inconsis- 
tent. The Prose configurator family, for example, supported batch input of 
requirements with consistency checking. 

Reasoners may choose different methods of handling inconsistencies. A 
requirement for a configuration system is that the underlying reasoner must 
be able to identify the inconsistency and notify the user. A helpful reasoner 
will also support users by allowing them to ask how the inconsistency was 
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deduced. The reasoner could also give users the option to “roll back” the 
specification to the last consistent state. For example, the Classic knowl- 
edge representation system required its information to be consistent, and 
thus once an inconsistency was detected, it disallowed the last statement 
that generated the inconsistency (maintaining a separate error state for de- 
bugging support) and then rolled back to the last consistent state. This was 
common for early DL-based systems. Today, however, Description Logics do 
not necessarily require consistent axioms to function. They may allow a set 
of inconsistent axioms to be input and then configurators can be built that 
utilize the Description Logic to identify if a description is satisfiable. This 
model of allowing inconsistent input with a user-identified checkpoint may 
be a model that supports collaboration and web-oriented development most 
naturally. 


12.2.9 Modularity 

In large systems, it is important to allow multiple people to work on spec- 
ifications in what appears to be a simultaneous environment. In Prose for 
example, care was taken to design a set of classes and roles that a number of 
developers could use. Multiple users were then allowed to work on specifica- 
tions of different portions of the configuration information simultaneously, 
with previously defined upper level classes and roles for their use in specify- 
ing more specific classes. When the users were finished with their particular 
component descriptions, loads were done to see if the different portions in- 
teracted. This model of individual users being in charge of specific portions 
of the ontology while possibly one chief ontologist is in charge of the upper 
level ontology is not uncommon. Cycorp, for example, publishes its upper 
level ontology which is maintained by a core Cycorp group while many other 
people develop more specialized mid-level ontologies. VerticalNet also has a 
number of ontologies with many different authors of specific ontologies that 
use an upper level ontology that is maintained by a core ontology team. De- 
scription Logics can be used to support such modeling, with Prose being 
an example of one such development. 

Another notion of modularity support can be considered with environ- 
mental support features. Some systems such as OntoBuilder [Das et al, 
2001] at VerticalNet have been built to support multiple users working on 
the same portion of an ontology in a more integrated manner. VerticalNet ’s 
system allows users to be notified if someone is modifying a portion of the 
ontology that they are using. While OntoBuilder does not have a DL back 
end, its input language is quite similar to OIL [Fensel et al, 2001] and thus 
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it is not a hard task to imagine that an ONTOBuiLDER-like system could be 
integrated with today’s DL systems. 


12.2.10 Maintainability 

Once systems are used for a long time period or are used enough so that they 
require support from someone other than their original author, maintainabil- 
ity becomes an issue. We have used examples from the stereo configurator for 
all of the other sections, but in this section we will draw from our experience 
with the Prose-Questar family of configurators. The stereo configurator 
has been up on the web for some years, yet it has not had many maintenance 
requirements because it is a demonstration system that is not updated when 
new stereo information becomes available. However, deployed configurators 
typically have help desk support and require data (and sometimes schema) 
updates. 

There are at least three components of maintenance that require some 
thought when planning a configurator: 

• product data updates 

• product specification updates 

• help desk support. 

The first is the simplest. Typically, product data requires updates over 
time. Simple things like prices and availability need updating and sometimes 
small updates are made with revisions. Typically, this kind of information 
is not hard to update - someone who does not know much about the en- 
coding can typically find a way to do things such as updating price fields in 
many applications, whether they are DL-based or not. Description Logics 
support this requirement since they are aimed at working with incomplete 
information (e.g., Subsection 12.2.7), and thus updates from incomplete to 
more complete information are natural for DL-based systems to handle. Sim- 
ilarly, an object-oriented modeling scheme may make updates simpler, but 
this area alone would not be enough to drive a potential user to a DL-based 
approach. 

The second issue of updates to product specification might be viewed by 
a database designer as a schema update. This kind of information is typ- 
ically more challenging to update in applications since it requires product 
specification descriptions and not just simple, data changes. It could be 
simple, requiring say a change to the range of a field; for example, possibly 
an age range may move from 18-65 to 18-70. Similarly, a business that used 
to accept only US currency may now accept other currencies, such as euros, 
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requiring updates to the price field value restriction. More complicated 
product specification updates may be done when new components become 
available (thus requiring someone to model the new components and their 
features). These types of specification updates are facilitated in Description 
Logics by the kinds of features that we noted in Subsections 12.2.6, 12.2.5, 
12.2.1, and 12.2.9. 

The third issue of help desk support has been noted as a strength of DL- 
based systems. One of the goals with the Prose configurator systems was 
to allow the help desk personnel to appear to perform at a level above the 
amount of training they had on individual products. The enabling infras- 
tructure toolset was to provide information to the help desk staff at the 
time they needed it in real time (instead of requiring them to have been 
previously trained on products so that they could answer questions from 
knowledge that they had learned instead of from knowledge that they could 
look up on demand). 

The tools were to allow them to explain any of the deductions that the 
system made when customers called in asking why something was (or was 
not) in their configuration and also allowed them to answer questions about 
why configurations were (or were not) valid. This was most facilitated by 
the functionality described in Subsection 12.2.4 but also by others such as 
Subsection 12.2.8. Similarly, they could answer hypothetical questions such 
as “what would happen if I chose component X instead of component Y in 
my configuration?” The goal was to meet individual customer needs without 
requiring engineering support to answer such questions. Our claim is that it 
is a combination of the strengths of Description Logics as discussed in the 
previous sections that helps support maintainability of the applications and 
in fact, helps support maintainability by people who have not taken classes 
in Description Logics or knowledge representation. 


12.3 The Prose and Questar family of configurators 

The longest-lived and most prolific family of DL-based configurators has 
been the Prose and Questar product line [Wright et al, 1993; McGuin- 
ness and Wright, 1998b]. AT&T began development on configuration prob- 
lems in 1988 in response to business requests for help in the streamlining 
of the Engineer, Furnish, and Install process. The goal in the process is to 
solicit a specification request from the customer through the sales process, 
and then engineer a solution that can be “furnished” and of course manufac- 
tured and delivered to the customer in a timely and cost effective manner. 
The initial goals of the project were to decrease the time from specification 
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to installation and to minimize the impact of contradictions in the specifi- 
cations and mistakes in the engineering. The initial configurator was built 
for a fiber optic transmission system (the FT Series G) although the initial 
deployment was for a digital cross-connect system (the DACS IV-2000). 

The initial configurator was successful enough that a family of config- 
urators was built around it. The history of the project proceeded from a 
research involvement to one of development. AT&T’s research division col- 
laborated with developers in order to build the initial system. Researchers 
helped generate and critique the initial conceptual models and program- 
ming effort. Developers generated the initial system but with interactive 
assistance from research. As the product evolved, project needs emerged for 
developer independence and an environment was produced that allowed do- 
main knowledgeable people to input configuration rules in a language that 
was comfortable to them. Developers had the lead responsibility; in the ini- 
tial deployment they had the assistance of research but in the second through 
seventeenth system they required little assistance from research for either 
generation or maintenance of individual configurators. As the development 
environment evolved, the developers saw much less of the DL back end - 
essentially the DL back end verified input and deduced conclusions and was 
otherwise hidden behind the interface of the system. 

There are a few points worth noting about this family of applications. 
First, the configurator family has shown longevity with some configurators 
deployed a decade after work began. Second, the majority of the generation 
and maintenance of the configurators was done by people who knew very 
little about Description Logics (thus showing empirical evidence that appli- 
cations do not require PhDs in Description Logics to build and maintain 
them). An evolution interface was developed by domain literate developers 
aimed at users who knew the products but did not know Description Logics 
or sometimes computer science at all. This interface allowed users both to 
maintain configurators and also to generate new configurators in the same 
product family. Third, there is a consensus that the DL-based approach both 
facilitates conceptual modeling (e.g., [McGuinness and Wright, 1998b]), and 
also makes maintenance much easier. Ford Motor Company has also stated 
similar findings with its long-lived DL-based configurator applications. 


12.4 Summary 

We have introduced the problem of configuration, describing briefly the na- 
ture of the problem and why many communities consider it important. We 
have described properties inherent in the problem that make it an area for 
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which one might consider DL-based approaches. We have provided exam- 
ples of all of properties in the setting of a stereo configurator, mentioning 
how a DL-based approach was used to solve the problem. We made parallel 
connections to the much larger configurators used for telecommunications 
equipment that also included the same issues and had DL-based solutions. 

We have also introduced the largest family of DL-based configurators - 
the Prose-Questar family of systems (noting also that at least one other 
commercial configurator at Ford Motor Company also has a similar life- 
span and a similar Description Logic-based approach). We observe that the 
Prose-Questar configurator family has been in continuous use for over 
a decade and has configured billions of dollars of equipment. We finally 
note that the commercial configuration examples with long histories state 
that the DL approach has alleviated the problems of conceptual modeling 
and configurator maintenance. Additionally, we speculate that this general 
architecture that meets the list of configuration needs might also be used in 
problem areas with similar needs. 
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Abstract 

Description Logics and related formalisms are being applied in at least five 
applications in medical informatics - terminology, intelligent user interfaces, 
decision support and semantic indexing, language technology, and systems 
integration. Important issues include size, complexity, connectivity, and the 
wide range of granularity required - medical terminologies require on the or- 
der of 250,000 concepts, some involving a dozen or more conjuncts with deep 
nesting; the nature of anatomy and physiology is that everything connects 
to everything else; and notions to be represented range from psychology to 
molecular biology. Technical issues for expressivity have focused on problems 
of part-whole relations and the need to provide “frame-like” functionality - 
i.e., the ability to determine efficiently what can sensibly be said about any 
particular concept and means of handling at least limited cases of defaults 
with exceptions. There are also significant problems with “semantic normal- 
ization” and “clinical pragmatics” because understanding medical notions 
often depends on implicit knowledge and some notions defy easy logical 
formulation. The two best-known efforts - Open Galen and Snomed-rt - 
both use idiosyncratic Description Logics with generally limited expressiv- 
ity but specialized extensions to cope with issues around part-whole and 
other transitive relations. There is also a conflict between the needs for 
re-use and the requirement for easy understandability by domain expert au- 
thors. OpenGALEN has coped with this conflict by introducing a layered 
architecture with a high level “Intermediate Representation” which insu- 
lates authors from the details of the Description Logic, which is treated as 
an “assembly language” rather than the primary medium for expressing the 
ontology. 
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13.1 Background and history 

13.1.1 Knowledge representation in medical applications 

Description Logics and related frame-based and conceptual graph formalisms 
are being applied in at least five applications in medical informatics: 

• Terminology development and, more broadly, the representation of information 
in health records. 

• Intelligent user interfaces. 

• Decision support and semantic indexing. 

• Semantics-oriented natural language processing. 

• Semantic integration of information systems. 

The seminal early work in the use of Description Logics in medical ap- 
plications focused on the dilemma between expressiveness and tractability. 
Doyle and Patil [1991] attempted to apply Nikl to medical vocabulary and 
came to the firm conclusion that the Nikl TBox language was too restrictive 
to be useful for this purpose. More explicitly they despaired of users accept- 
ing the restrictions of minimally expressive TBox languages and predicted 
that users would find “work-arounds” which defeated the logical rigor which 
was their raison d’etre. A first attempt at a more appropriate representation 
was made by Jang and Patil [1989]. 

However, as providing a standard controlled medical vocabulary came to 
be seen as one of the central issues of medical informatics, some researchers 
saw “compositional systems” as the only plausible route forward. The per- 
ceived urgency of the task motivated “pragmatic” approaches. Masarie 
et al. [l99l] used a large frame-based AI environment to produce an “in- 
terlingua” linking three of the then current terminologies in one of the ex- 
ploratory projects to what became the Unified Medical Language System 
[Evans, 1987]. 

Although the US National Library of Medicine chose to use lexical meth- 
ods to cross map existing terminologies rather than to develop Masarie’s 
approach to a logical interlingua, the project gave rise indirectly to the 
CANON group who became strong advocates of formal representations in 
medical terminologies [Cirnino, 1994; Evans et al. , 1994], A special issue of 
the American Journal of Medical Informatics (volume 1, issue 3) summarised 
the material from its seminal workshop. 

The CANON group brought together several other strands of then current 
work: 

• The Medical Entities Dictionary developed by Cimino et al. [1989] as a large 
semantic network. 
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• The related Galen [Rector et al., 1993; Rector and Nowlan, 1994] and Pen&Pad 
[Nowlan et al., 1991a; 1991b; Nowlan and Rector, 1991] programs from Europe. 

• A series of projects on the use of Sowa’s conceptual graphs for representing med- 
ical vocabularies, the best known of which is the one by Campbell et al. [1994] 
but the series includes also work by Bell et al. [1994]. 

In addition, the group interacted with more linguistic work by Friedman 
et al. [1994] and Sager et al. [1994] which, along with Tuttle [1994], served 
as a contrast and a reality check. 

There have been two large scale outcomes of this work: 

• The SNOMED-Reference Terminology (Snomed-rt) and SNOMED-Clinical Terms 
(Snomed-Ct) projects under the College of American Pathologists, 1 which seek 
to produce a terminology all of whose concepts are represented in a subset of Krss 
and formally classified, and which were released at the end of 2000 [Spackman 
et al., 1997]. A further cooperation with the UK Clinical Terms project is to 
produce an international version to be released in 2002. 2 

• Open Galen, which seeks to produce a reference ontology in a specialized Descrip- 
tion Logic for use in developing and managing other terminologies and indexing 
knowledge required for decision support, user interfaces and other knowledge 
management tasks. 3 

In addition there have been a number of projects on language processing 
in medicine which have included significant work on formal knowledge rep- 
resentation, particularly the work by Hahn using Loom [Hahn et al., 1999a; 
1999c], which has produced a range of large scale results in both language 
engineering and ontologies proper, and by Zweigenbaum using a specially 
restricted frame representation in a similar way [Zweigenbaum et al., 1995]. 
Another important task is the indexing and retrieval of medical literature, 
which has been addressed by McGuinness [1999]. 

Applications of ontologies within medicine, not based on Description Log- 
ics, include the work by Musen [1998] on re-usable problem solving meth- 
ods and ontology driven knowledge acquisition in the Protege project 
which, at least so far, has specifically not used a Description Logic or 
other formal basis for its ontology, but rather based its ontologies around 
the OKBC and DAML standards. As these standards are converging 
with Description Logics in OIL and DAML+OIL [Fensel et al., 2001; 


1 http://www.snomed.org/ 

2 http : //www. coding. nhsia.nhs .uk/ 

3 http://www.opengalen.org 
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Horrocks and Patel-Schneider, 2001], convergence with Protege is under 
active discussion. 

Stefannelli and Schreiber likewise have produced a body of work based 
around adaptations of the KADS architecture using ontologies as the ba- 
sis for intelligent systems and agent architectures [Schreiber et al., 1993; 
Vanheijst et al, 1995; Falasconi et al, 1997]. 

Another major effort on knowledge representation in medicine is the 
Digital Anatomist project [Rosse et al, 1998; Agoncillo et al, 1999; 
Mejino and Rosse, 1999], which currently does not use a Description Logic 
but which represents a benchmark for a comprehensive, carefully curated 
and validated knowledge base based on carefully analyzed ontological com- 
mitments and distinctions manifest in a meticulously defined hierarchy of 
high level concepts such as “organ” , “tissue” , etc. It poses a challenge to any 
system aspiring to a comprehensive representation of medical knowledge. 


13.1.2 The medical environment 

Behind most of these applications is the aspiration to re-use clinical data - 
either to integrate systems, to link patient records to decision support and 
knowledge management, or to re-use information collected in the course of 
patient care for management, remuneration, quality assurance or research. 

There has been a widespread move to greater integration and to “Elec- 
tronic Patient Records” (EPRs), also known variously as “Computer based 
Patient Records” (CPRs) or (CBPRs). The goal behind these moves is three- 
fold: 

• To improve patient care through providing better information on current pa- 
tients, warnings, and decision support to healthcare professionals - e.g., to be 
able to identify patients’ known problems and treatments, warn of potential drug 
interactions and contraindications, or suggest management based on established 
guidelines. 

• To capture improved information for planning and management within healthcare 
institutions by re-using information collected at the point of care for all secondary 
functions - e.g., to re-use diagnosis and treatment information collected during 
patient care for statistical reporting, quality assurance, and remuneration. 

• To integrate the disparate information systems typical of most healthcare insti- 
tutions. 

Major reports justifying electronic patient records have been issued, 
amongst others, the Institute of Medicine [Dick and Steen, 1991], the Com- 
puter based Patient Record Institute (CPRI), and the UK National Health 
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Service [NHS National Health Service Executive, 1998]. This pressure is in- 
creasing with moves to greater clinical accountability and concern with clin- 
ical errors [Kohn et al . , 2000]. That every patient should have an electronic 
medical record is now government policy in a number of western countries, 
including the UK and US. 

Despite the widespread use of management, billing, and laboratory sys- 
tems in medicine, the vast majority of the information required for such 
medical records currently exists only as unstructured narrative text. Cap- 
turing more of this information in structured form is a central task of medical 
informatics. The absence of a standard “controlled vocabulary” or “coding 
system” is seen as a major barrier to this task [Sittig, 1994] while its pres- 
ence is a key to its success [Rossi Mori and Consorti, 1999]. Hence several 
countries have mandated, or will soon mandate, standard terminologies for 
use in medical records. 

However, most existing terminologies or “coding systems” are mono- 
hierarchical classifications developed either for public health reporting (the 
International Classification of Diseases - ICD) or bibliographic retrieval (the 
Medical Subject Headings - MeSH). They are much too coarse grained for 
recording care of individual patients. Attempts to extend them to make 
them finer grained have run into combinatorial explosions with some sys- 
tems now running to over 250,000 “terms” which are beyond manual main- 
tenance. Their structure is largely implicit, and writing software to use 
them is therefore problematic. An alternative faceted system, Snomed- 
International, has existed for some time, but has no strong semantics 
defining the relationships amongst the facets and has always been con- 
sidered difficult to use outside its origin in Pathology - both because 
of its unfamiliar structure and an organization which reflects its origins 
in pathology and often does not cater for the needs of other medical 
specialities. 

The US National Library of Medicine has mounted a major program to 
tame this chaos in its Unified Medical Language System (UMLS) which cross 
maps, insofar as possible, all of the general and special purpose vocabular- 
ies [Lindberg et al . , 1993]. It has developed into a massive (15 Gb) cross 
reference and cataloging system. 4 However, although cross referenced, the 
Unified Medical Language System is fundamentally limited by the nature of 
the underlying systems which it cross maps. It itself provides only a minimal 
amount of additional semantic information - less than 200 categories in a 
loose semantic network. 


http : //umlsks . nlra . nih . gov/ 
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Hence the hope by various researchers that DL-based ontologies can pro- 
vide a better solution for at least some of the problems of terminology, 
decision support, language processing and integration. 


13.2 Example applications 

13.2.1 Description Logics in terminology development and 

“ coding ” 

13.2.1.1 Snomed-rt: tightly coupled development and pre- coordination 

Snomed-rt is a cooperative enterprise between the College of American 
Pathologists and Kaiser Permanente, a large health maintenance organi- 
zation. It has re-represented in a subset of Krss the information in the 
SNOMED-International. In a first approximation, the Snomed facets for 
anatomy, morphology, function, etc. have been turned into roles, hasTopog- 
raphy, hasMorphology, etc. [Campbell et al, 1998]. The initial mechanical 
translation has then been re-modeled in place by domain experts using a 
set of tools with a highly developed change management mechanism [Camp- 
bell, 1998]. The development methodology has placed a high emphasis on 
achieving repeatability of domain experts’ results, and made extensive use 
of lexical tools to suggest additional relationships which are implied by the 
rubrics but may not be explicitly present in the faceted representation; for 
example the term “retinal vasculitis” was correctly related to “eye” but 
not to “vasculitis” (inflammation of the blood vessels) in early versions of 
SNOMED-International [Campbell et al., 1996] 

The first released version consists of a pre-enumerated set of 180,000 or 
more disease and procedure codes, each defined in an ontology represented in 
Krss and classified accordingly into an acyclic directed graph. The intention 
appears to be a standard pre-coordinated (i.e. , pre-defined) set of concepts 
and associated terms to be presented and used in a form analogous to that 
of traditional hierarchical coding schemes. 

Recently a collaboration has been formed between Snomed-rt and the 
UK Clinical Terms (Read Codes) project to produce a combined prod- 
uct which is aimed at being a standard English controlled vocabulary for 
medicine. Details have not yet been announced, but it is assumed that the 
form will be closely related to that of Snomed-rt. 

The ontology used is relatively shallow, including under ten roles in its pre- 
release version, and avoiding embedded expressions wherever possible. How- 
ever, the standard semantics of Krss has been enhanced by the inclusion of 
right-identities to cater for part-whole relations (see Subsection 13.3.2). 
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Snomed-rt itself includes no tools or transformations for data entry or for 
other applications involving dynamic post-coordination. However, a range 
of tools based on Snomed-rt, including the authoring suite, is available 
from the company that supplies the development tools (Apelon 5 ), which 
are descended in part from K-Rep, a DL-style KR system used in many 
of the early experiments which led up to the project [Mays et al., 1991a; 
1996]. 

13.2.1.2 Galen : loosely coupled development and post-coordination 
Galen is the result of a series of European Commission funded projects and 
its ontologies and specifications as well as some of the tools are available in 
open source form from http : //www . opengalen . org/. 

The Galen tools are designed for loosely coupled development, and 
the ontology is aimed primarily at post-coordinated applications, such as 
intelligent user interfaces, and tools to empower users to adapt core termi- 
nologies to their specific needs. It is based around the idea of a dynamic 
“terminology server” rather than an enumerated table of pre-coordinated 
terms [Nowlan et al., 1994; Rector et al., 1995a], although there is a limited 
set of common concepts pre-defined. 

An important feature of Galen is the clean separation of functions within 
the server architecture: 

• logical representation in the Description Logic; 

• language generation and text recognition; 

• mapping to and from existing coding systems; 

• indexing of non-terminological information; 

• additional calculations such as unit and coordinate transformations. 

Galen’s ontology was created de novo but with close reference to the 
standard classifications, particularly the International Classification of Dis- 
eases. It uses the Grail Description Logic [Rector et al., 1997] whose core 
includes the subset of operations of the Krss used by Snomed-rt, including 
transitive roles, with the addition of inverse roles and role subsumption. (See 
Subsubsection 13.3.2.2 for a further discussion of transitive roles and related 
issues.) In addition Grail provides a construct “sanctioning”, analogous to 
slot definitions in frame systems or function signatures in object-oriented 
systems, which supports answering queries of the form “what can be said 
about this?” Grail is implemented using a graph comparison algorithm 
which, although known to be incomplete, has still proved to be extremely 
useful in practice. 

5 http://www.apelon.com/ 
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Galen’s most distinctive feature is the use in authoring tools for domain 
experts of a much simplified “intermediate representation” which is then 
translated into the Description Logic, which is relegated to the status of an 
“assembly language” (see Subsection 13.5.1 below). 

The Galen project has also devoted much effort to mapping to existing 
coding systems - a more complex task than is at first apparent because 
of the idiosyncratic construction of the target schemes. Each code in such 
schemes is mapped to the disjunction of one or more Galen concepts. 
A Galen concept is taken as being mapped to the most specific code 
mapped to a subsuming concept, and conversely, a code is mapped to all 
those Galen concepts subsumed by its mapping except those subsumed 
by a more specific mapping. This mechanism deals with almost all of the 
complex sets of exclusions and inclusions in the International Classification 
of Diseases - e.g., “Hypertension excluding hypertension in pregnancy” is 
coped with automatically simply by mapping to the general concept “Hy- 
pertension” , because there is a mapping to a specific concept “Hypertension 
in pregnancy” which will cause it, and its descendants, to be excluded 
automatically. In the very few cases where conflicts occur they are resolved 
by separate exception-handling tables. 

A similar mechanism provides a surrogate for inheritance with exceptions 
as a means of indexing information ranging from triggers for decision support 
rules to data entry forms and user interface specifications. Any information 
may be labeled and attached to the ontology, and the server provides op- 
erations to retrieve the set of all the values “inherited” . The Galen server 
makes no attempt to reduce the set to a single value; if required this is a 
matter for the client application. 


13.2.2 Description Logics and language processing 

13.2.2.1 Language analysis and information extraction 

Most medical information originates and is stored as natural language text. 
Medical texts present classic “sublanguages” with peculiarities of vocabulary 
and syntax. Many utterances are telegraphic or highly elliptical, and cannot 
be easily parsed without semantic knowledge. These features seem natural to 
combine with lexicalized grammars in which most or all syntactic informa- 
tion is stored with the lexical item rather than in a separate grammar, e.g., 
Tree- Adjoining Grammars (TAG) [Joshi, 1994], Lexical-Functional Gram- 
mar, and Combinatory Categorical Grammar (CCG) [Steedman, 1996]. 6 


However, it should be noted that the classic medical natural language work, the Linguistic 
String Project [Sager et al ., 1987; 1994], while it makes extensive use of semantics, makes no 
use of ontologies or related mechanisms. 
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Hahn’s work on medSYNDiCATE [Hahn et al, 1999a], provides a de- 
tailed example using a specially constructed ontology in Loom. The 
medSYNDiCATE architecture features close coupling of the ontology (“Do- 
main knowledge base” ) with the parser and extensive use of learning tech- 
niques to deepen and extend both the ontology and the grammar. It uses the 
integrity conditions, and conceptual constraints, and cardinality restrictions 
in the ontology to reduce ambiguity and select plausible interpretations. It 
makes use of knowledge within the ontology to complete ellipses within the 
original text - e.g., to know that the connection between a gland and its 
product is “secretes” . It also makes extensive use of partonomic information 
using a unique approach discussed in Subsection 13.3.2.3 below. 

Rassinoux and Baud have used the Galen ontology to augment a strongly 
semantic approach likewise to constrain ambiguous or incomplete parsings 
[Baud et al, 1993; Rassinoux, 1998]. Zweigenbaum has used a restricted 
application specific ontology to similar purpose [Zweigenbaum et al., 1995]. 

Ceusters, by contrast, attempted to use natural language processing to 
understand the text attached to codes (the “rubrics”) to build and make 
mappings to the Galen ontology. Ceusters’ work was based on a range of 
pre-existing tools and experienced significant difficulty because of serious 
differences in the information-processing-oriented ontology developed by 
Galen and the language-oriented ontologies which underlay his tools. For 
example, the distinctions between location and part-whole relations and the 
distinctions amongst different part-whole relations have no direct linguistic 
counterpart. An adaptation of the Galen Intermediate representation was 
used to bridge this gap, but with only partial success [Ceusters and Spyns, 
1997; Ceusters, 1998; Ceusters et al, 1999]. 


13.2.2.2 Language generation, user interfaces, and quality assurance 

Any ontology intended for use by domain experts presents a problem of qual- 
ity assurance, or curation, by those experts. Any post-coordinated use of an 
ontology also presents a serious problem for the user interface - standard DL 
expressions are not acceptable for most uses by most domain experts. Even 
if they are simplified to an “intermediate representation” or transformed to 
conceptual graphs, the complexity is too great for most domain experts to 
take in quickly. 

One way to make such expressions accessible to users is to generate 
language expressions from them. Not only are the language expressions 
more readable, they are usually much more compact. Galen has found 
language generation to be essential in virtually all applications involving 
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post-coordination including most approaches to independent quality assur- 
ance of the ontology. 

One of the major applications of Galen technology has been by the 
French government to produce unambiguous definitions for their new na- 
tional classification of surgical procedures. Curiously, in this application, 
the usual language generation goals of concise idiomatic expression do not 
apply. The value of the technique is its pedantic, but completely unambigu- 
ous, presentation of the underlying formal definitions. Once the definitions 
are agreed and quality assured, idiomatic “preferred terms” can be composed 
manually where required [Baud et al., 1997; Rodrigues et al, 1997]. 


13.2.3 Decision support, indexing, and re-usable ontologies for 

problem solving 

Many decision support methodologies, notably Musen’s Protege and 
Aeon [Tu et al, 1995; Musen et al., 1996; Musen, 1998; Grosso et al, 
1999] and Stefanelli’s Games [Schreiber et al, 1993; Vanheijst et al, 1995; 
Falasconi et al., 1997], are based around the existence of a domain ontology, 
but in general the ontologies are constructed specifically for one applica- 
tion and have proved less re-usable than the problem-solving methods they 
support. Both use ontologies primarily as frame systems. 

A more specific use of the classification reasoning in Description Logics is 
provided by Galen’s work on drug ontologies carried out in collaboration 
with the Prodigy project on computerized guidelines for prescribing in UK 
general practice [Johnson et al., 2000]. Traditional classifications for diseases 
and drugs have only a single axis of generalization which conflates several dif- 
ferent criteria. For example, standard drug classifications conflate indication 
(e.g., for “treatment of asthma”), molecular effects (e.g., “stimulates alpha 
adrenergic receptors”), physiological effects (e.g., “dilates the airways”) and 
chemical structure. As result, even simple generalizations such as “steroids 
reduce inflammation” are difficult to operationalize using the classification 
because various steroids may be classified in many different ways - under 
anti-asthnratic drugs, topical skin preparations, anti-rheumatic drugs, etc. 

Separating the conflated axes and then using them as the basis of formal 
descriptions which can be classified by a Description Logic offers a potential 
solution. After early prototype demonstrations [Solomon and Heathfield, 
1994], Galen is now being used to construct an ontology of drugs and related 
conditions to be used as part of the Prodigy project, a system of protocols 
for prescribing for patients with chronic diseases which is being developed 
by the UK Department of Health [Solomon et al., 1999; Wroe et al., 2000]. 



446 


A. Rector 


Experience to date suggests that the ontology provides efficiently precise 
indexing at the varying levels of granularity required and can provide a 
framework for the necessary default reasoning via the mechanisms described 
in Subsection 13.2.1.1 for coding. Further evaluation awaits the next phase 
of the project. 


13.2.4 Intelligent data entry 

Data capture is the largest single barrier to greater information use in health- 
care. Galen developed from the Pen&Pad project [Nowlan et al., 1991a; 
1991b] which aimed to improve user interfaces for healthcare profession- 
als and which placed particular emphasis on data entry by attempting to 
construct forms which would capture most, if not all, of the information 
currently recorded as narrative text. 

The ontology provides two services in Pen&Pad - both related to the 
question “What can be sensibly said in this situation?” : 

• Indicating how a given concept could be refined by modifiers. 

• Indexing the form associated with each starting concept - often a disease or 
a symptom. Each such form may contain numerous subforms allowing further 
refinement of a concept or inclusion of further less common signs and symptoms. 

The total number of forms required to provide a clinical interface is very 
large - certainly hundreds of thousands and possibly more. The goal of the 
system is to assemble forms dynamically from the indexed “recipes” in such 
a way that it would fail soft - i.e. , that forms for important frequently 
encountered situations could be highly tailored at a very fine granularity 
whereas rarely encountered areas could be served by a form related only 
to the broad class of condition. In its commercial version, Clinergy™, a 
knowledge base of under 10,000 concepts and a similar number of auxiliary 
facts and forms specifications covered essentially all data entry for British 
general practice - a task requiring several hundreds of thousands of forms. 

Related systems were developed by Poon and Fagan [1994] and 
Lussier et al. [1992], using conceptual graph representations of Snomed- 
International. 


13.2.5 Integration 

A major ostensible goal for common terminologies in medicine is system 
integration [Evans et al., 1994; Rector et al., 1995b; Spackman et al., 1997]. 

See http://www.galen-organisation.com/furthertut.html for further information. 


7 
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While specialized terminology systems are being used in a few places as 
part of an enterprise wide effort at integration [Rocha et al. , 1993; 1994; 
Cirnino et al., 1998], ontologies based on Description Logics have yet to be 
demonstrated convincingly in this context. Much of the reason for this is 
the sheer scale and coverage required for such mediation tasks. 


13.3 Technical issues in medical ontologies 

13.3.1 Issues of scaling 

13.3.1.1 Size 

The fundamental issue in any medical ontology intended to capture clini- 
cal terminology is scale. The smallest useful medical terminologies contain 
on the order of 10,000 concepts; “comprehensive” terminologies require on 
the order of 250,000 or more concepts. The OpenGALEN model of basic 
anatomy alone contains over 5000 concepts, the model of surgical proce- 
dures some 15,000. Snomed-rt currently has some 180,000 concepts, and 
the combined Clinical Terms (Read Codes) Snomed-ct expects to have sub- 
stantially more. The Unified Medical Language System has issued nearly a 
million “Unique Concept Identifiers” with over a million lexical variants. 

13.3.1.2 Connectivity 

Medical ontologies are notoriously highly connected. Most medical concepts 
depend on anatomy, and every anatomical structure is ultimately connected 
to every other, at least trivially, by virtue of being part of the body. The 
causal and functional interrelationships are of similar density. Snomed-rt 
reduces connectivity by omitting inverses. Grail supports role inverses and 
transitive roles, but Galen’s ontology explicitly avoids expressions of the 
form “A which is part of B which has part C”, for which the classifier is 
known to be incomplete. It is not known whether complete and decidable 
reasoning for a Description Logic including role transitivity and inverses 
is practical for a large scale comprehensive medical ontology: some form 
of heuristic constraint on the depth or computational resources used for 
individual inferences may prove necessary. 

13.3.1.3 Range of granularity or organization 

Common medical notions span the range from the molecular to the physio- 
logical to the behavioural. To form a truly re-usable framework for medical 
knowledge representation, the ontology needs to encompass concepts such 
as “substances which cause mood change and tremor by binding to specific 
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receptor sites” . If the promise of “genomics” is to be realized, this may soon 
need to be extended to include concepts which add . . by stimulating the 
expression of a genetic sequence homologous to some specified allele in some 
reference source” . 

13.3.1.4 Complexity of concepts to be represented 

The areas of medicine most resistant to traditional manual terminologies 
and therefore most ripe for formal representation tend to include very com- 
plicated concepts. For example, a not untypical surgical procedure rubric to 
be represented might be “Removal of the gall bladder using an endoscope 
inserted via an abdominal incision” or “Fixation of fracture of the femur 
by means of insertion of pins” . More complex rubrics may go on for several 
lines in their natural language formulation. The full expansion in a Descrip- 
tion Logic may include several dozen conjuncts nested five or six levels deep. 
This complexity is not an academic artifact; these are the categories used 
to determine payment, quality of outcome, and prognosis. 

13.3.1.5 How much to represent - detail of the ontology 

Snomed-rt has a relatively simple ontology with less than ten roles. The 
Galen ontology is relatively complex, with some fifty roles, including seven 
different partonomic roles, and sharp distinctions between two-dimensional 
and three-dimensional objects. The Digital Anatomist appears to be a rep- 
resentation of similar complexity to Galen’s anatomical representation. 
At the extreme, Gangemi et al. [1996] have produced a high level ontol- 
ogy which claims strong philosophical grounding but is yet more elaborate. 
How much of this complexity is required for which purposes is still not 
established. 


13.3.2 Issues of expressivity: part-whole relations 

13.3.2.1 Transitivity and anatomy 

A large fraction of all medical terminology is based on anatomy and de- 
pendent on part- whole relations. “Fracture of foot” must be classified as 
“trauma to lower extremity” , “repair of the aortic valve” must be classified 
as an “operation on heart” , etc. 

Conflation of part-whole and IS-A relations is ubiquitous in informal clin- 
ical classifications and thesauri [Rector, 1998]. In general this works because 
for the key locative attributes it is, in general, true that a disease of the part 
is a disease of the whole and a procedure on a part is a procedure on the 
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whole. This is closely related to Cyc’s TRANSFERS-THRO notion and to 
some frame systems’ notion of inheritance of certain slots via relations other 
than IS-A. 

13.3.2.2 Galen ’s specialisedBy axioms and Snomed-rt’s 
right identity axioms 

All medical ontologies must face this problem in one way or another. Galen 
allows axioms equivalent to R o S C R (R specialisedBy S in Grail nota- 
tion). Snomed-rt allows the declaration that S is a right identity for R, 
which appears to be equivalent [Spackman, 2000]. 

Hence if R is hasLocation and S is isPartOf, then 

EhasLocation.(EisPartOf.Heart) C ElhasLocation. Heart 

where hasLocation is the relation used to link lesions and diseases to anatomy. 
Given axioms such as that 

AorticValve C EisPartOf.Heart, 

the required inferences that lesions of the aortic valve are lesions of the heart 
follows, i.e. , it can be inferred that 

ElhasLocation. AorticValve C EhasLocation. Heart. 

There are, in practice, a variety of other situations in which this construct 
seems essential, for example to say that the “risk of a syndrome involving a 
disease” is subsumed by a “risk of the disease itself” . 

Galen also makes extensive use of the implication of such axioms for the 
inverse roles, i.e., S _ oR“ C R~. For example, let S be isSubProcessOf and R 
be isActedOnBy, then S“ and R^ are hasSubprocess and actsOn respectively. 
The implication of such an axiom for the inverse roles then allows us to 
express the rule that surgical procedures can be said to act on all those 
structures acted on by their subprocedures, e.g.: 

EhasSubprocess.(EactsOn.FemoralArtery) C EactsOn.FemoralArtery. 

This is a practical example. The femoral artery is the usual route by 
which the heart is catheterized. Without such inferred subsumptions, cardiac 
catheterization would not be found as a target for the procedure - e.g., by a 
decision support system seeking to identify possible causes of damage to the 
femoral artery. Numerous parts of the classification of surgical procedures 
depend on such inferences. 
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The Grail language allows chains of such axioms, which can imply com- 
plex paths. Such axioms also interact strongly with the role hierarchy. Re- 
representing these paths as regular expressions of roles taking into account 
the role hierarchy is a current topic of research. 

13.3.2.3 The “triples” approach 

Hahn et al. [1999c; 1999b] have developed an alternative representation for 
partonomic relations based on what they have termed “SEP-triples” , which 
captures much partonomic reasoning within a framework compatible with 
the standard ACC Description Logic. In the SEP-triple formulation, each 
anatomical part X is represented by a parent concept X s , and two subsumed 
concepts X e and X p . X e represents the entity as a whole, and X p the concept 
of its parts. For all parts Y of X , X p subsumes Y s , and since Y s subsumes 
both Y e and Y p , both the entire part Y e and all of its parts Y p are subsumed 
by the parts of X. 

Y p X p X s 

X p C 3anatomicalPartOf.X e . 

This captures the transitive relation, i.e., that any part of Y is a part of X. 

For invariant anatomical relations, a separate existentially qualified role 
called hasAnatomicalPart links X e to Y e . 

X e C 3hasAnatomicalPart.y e . 

This scheme allows Hahn to capture the notion that something is always 
part of the whole if it is present, but that it may not necessarily be present 
(e.g., that it may have been removed or be congenitally absent) - this is 
achieved by omitting the third axiom. 

This allows inferences such as that a disease of a part must be a disease of 
the whole structure (s) node, but not of the whole taken as in its entirety (e) 
node. By careful selection of which of the three members of an SEP-triple 
is used in an assertion, it appears to be possible to be selective about which 
properties are “inherited”. For example: “diseases of parts are diseases of 
the whole”, but “surfaces of parts are not surfaces of the whole”. Hence 
in Hahn’s schema, “surface of” should always refer to an entity (e) node 
representing the entire object, whereas diseases should refer to the structure 
(s) node representing the complex of the entire object and all of its parts. 

Detailed comparison of the expressiveness of SEP-triples with Snomed- 
rt’s right identities and Galen’s specialisedBy axioms is not yet known. 
However, the scheme presents a number of advantages and is relatively easy 
to implement with existing classifier technology. 
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13.3.2.4 Construct not implemented in any major medical ontology 
Padgham and Lambrix [1994] point out a number of other potential patterns 
for relationships between parts and wholes of which at least one is potentially 
important for anatomical reasoning but not implemented in any current 
Description Logic. This formalizes the pattern that from “the hand is part 
of the arm” we may infer that “the skin of the hand is a part of the skin of 
the arm” . One way to capture the essence of this notion formally would be 
to allow axioms of the form RoSCSoRso that we have 

isLayerOf o isPartOf C isPartOf o isLayerOf, 

from which may be inferred, for example, 

3isLayerOf.(3isPartOf.Arm) C 3isPartOf.(3isLayerOf.Arm). 

The Galen ontology makes the necessary distinctions between different 
partonomic relations but the Grail language does not implement this in- 
ference. 


13.3.3 Other issues of expressivity 

Both Galen and Snomed-rt use Description Logics with a very limited 
range of core constructors - usually only existential quantification and con- 
junction. Both even exclude conjunctions of primitives. Neither uses univer- 
sal quantification in its constructors, although Grail’s “sanctioning” mech- 
anism provides constraints which serve some of the same functions [Rector 
et al, 1997]. (Hahn uses Loom, but exploits only a limited subset of the 
concept language.) On the other hand, both include constructs for transi- 
tive relations as described above. Two other issues deserve mention. 

13.3.3.1 Negation 

Neither Galen nor Snomed-rt uses negation, at least in the subset of 
the DL used in the ontology itself. This reflects real questions about 
the appropriate interpretation of negative statements in clinical records. 
In the context of medical records, there needs to be a clear differentia- 
tion at all levels between “false” and “not done” or “unknown”. Galen 
simulates some of the effects in the ontology by the use of “modalities” 
such as “presence/absence” and “done/no t-done” [Rector and Rogers, 2002; 
Rector et al, 2002], 
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13.3.3.2 General inclusion axioms 

Galen makes extensive use of a subset of general inclusion axioms - i.e. , 
axioms which state that one defined concept is classified under another con- 
cept. In Galen the subsuming term is restricted to be a conjunction of 
existentially qualified constructed concepts. Galen uses such expressions 
for two purposes: 

• To indicate which structures, states and processes are normal, abnormal but 
harmless, or pathological, i.e., to be treated as “diseases”. In many cases it is 
the presence of specific modifiers which implies that a structure or process is 
“pathological” . 

• To bridge levels of granularity and add implied meaning, e.g., to indicate that 
“ulcer of stomach” really occurs in the “lining of the stomach” or to cope with 
normalization as discussed in Subsection 13.4.2.2. 

Many Description Logics have explicitly disallowed general inclusion ax- 
ioms because of the difficulty of devising suitable algorithms and worries 
about intractability. However, motivated by Galen, Horrocks has shown 
effective optimizations for Description Logics including general inclusion ax- 
ioms. Furthermore, he has shown that all such axioms in Galen are of a 
particular form which can be transformed so as to be “absorbed” within 
term definitions, and therefore reasoned with relatively efficiently [Horrocks 
and Rector, 1996; Horrocks et al, 1996; Horrocks, 1997b; 1998b]. 


13.3.4 Frame-like behavior 

The use of Description Logics in both decision support and data entry sys- 
tems stemmed from the use of frame systems to manage default inheritance 
and identify the slots relevant to a particular object. Neither is easy to im- 
plement directly in Description Logics. Both are particularly important in 
medical applications. Because of their size and variability, exhaustive manual 
enumeration of cases is neither practical initially nor maintainable. 

13.3.4-1 Defaults and indexing 

A major function of an ontology in a decision support system is to index 
information. However, the natural representation for a domain expert of this 
indexing is usually in terms of generalizations with exceptions. For exam- 
ple drug indications, interactions and side effects are all almost invariably 
expressed as general principles plus exceptions (chemical structure, biochem- 
ical and physiological actions can usually be treated as being indefeasible). 
To require all statements to be indefeasible in the domain users’ environment 
drastically limits its usability and usefulness. 
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Galen’s approach is to attach “extrinsic” statements to the ontology 
and provide operations in the server which deliver all potential most specific 
candidates as described in Subsection 13.2.1.2. Experience has shown that if 
the ontology is well constructed, the incidence of conflict is small and almost 
always represents a real requirement for additional information. Often this 
information is application specific - how seriously a drug’s side effects should 
be viewed in a given situation, for example, or which of several minor variant 
codes matches the World Health Organization’s detailed coding criteria - 
and not appropriate to a re-usable ontology. 

It has been suggested that similar behavior could be achieved by “com- 
piling” all defaults at the user level to explicit exclusions in the underlying 
Description Logic. A practical demonstration of this approach on a large 
scale in the medical field has yet to be seen. 

13. 3. 4-2 Available “ slots “what is it reasonable to say?” 

Galen’s original approach was to represent “all and only what it is med- 
ically sensible to say”. Pen&Pad (as well as non-medical uses of Grail 
such as the Bioinformatics project Tambis [Baker et al, 1998]) depends 
on assembling data entry forms and queries dynamically. The total number 
of potential forms is vastly greater than could be enumerated individually. 
Both applications depend on being able to determine which roles are “sen- 
sibly” applicable to a particular concept. Grail’s sanctioning mechanism 
provides this information directly, but there is no direct way to form such 
a query within a standard DL framework. How best to address this issue 
remains a topic for research. 

A key part of the Galen experience in this regard is that only part 
of this “sanctioning” information is re-usable. In the original Pen&Pad 
application, changes to the user interface were made by changing the un- 
derlying ontology. In Galen, and in the commercial version of Pen&Pad, 
Clinergy™, changing the re-usable ontology to fit an application specific 
requirement was unacceptable, so an additional layer of “perspectives” was 
interposed between the ontology itself and applications. This layered archi- 
tecture now seems essential to many applications of ontologies which aspire 
to be re-usable. 


13.4 Ontological issues in medical ontologies 

13.4-1 Normative statements and abnormalities 

Congenital and other deformities present a major difficulty to clinical knowl- 
edge representations, because they require that statements which would 
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otherwise be absolute be made somehow contingent and that an extremely 
wide variety of statements be permitted in exceptional circumstances. They 
also require drawing distinctions that seem odd. Even in a thalidomide pa- 
tient with an absent left arm, we still need to be able to make statements 
about the left arm. Hence physical and potential presence must somehow be 
distinguished. 

Likewise, in determining what it is “sensible” to say, congenital anomalies 
make a nonsense of the usual constraints. For example, most patients have 
their heart on their left side, three lobes to their right lung, and two lobes to 
their left. Most patients have a “right middle lobe” but no “left middle lobe” 
of the lung. However, a small percentage of patients reverse the pattern. The 
anomaly is not always complete, so many combinations of abnormalities are 
possible. Doctors tend to be highly intolerant of being presented with options 
such as “left middle lobe” in normal circumstances. Unfortunately, they are 
equally intolerant of the inability to express the notion of a “left middle lobe” 
in that small number (<Cl%) of cases where it is needed. Taken individually, 
such anomalies are rare. Taken collectively, they are surprisingly common, 
i.e., a significant percentage of all patients are atypical in one respect or 
another. 


13.4- 2 Clinical pragmatics 

13. 4- 2.1 Conventional idioms 

As in any language, many terms or phrases have conventional meanings 
different from their literal interpretation. Such differences are not always 
immediately obvious. A typical example is “endocrine surgery” which it 
might seem natural to define as “surgery on an endocrine organ” . However, 
procedures on both the male and female reproductive organs are normally 
excluded, even though no doctor would dispute that they are endocrine 
organs. Similarly, “heart valve” might naively be defined as a “structure in 
the heart with valvular function”, but this includes numerous embryonic 
and sometimes congenitally deformed structures as well as the four “major 
valves” which serve the four “great vessels” entering and leaving the heart. 
Much of the effort of formulating a satisfactory medical ontology goes into 
reconciling such conventional usages with their apparent meaning. 


13. 4-2. 2 Normalization and implied information 

Many medical notions, particularly of actions and procedures, carry strong 
implications about their purpose. O’Neil’s classic example illustrates this 
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problem [O’Neil et al, 1995; Brown et al, 1998]. A common procedure to 
treat hip fractures is “insertion of pins in the femur”. The only reason to 
insert pins in the femur is to “fixate” a fracture, and the operation is ex- 
pected to be classified under both “insertion of pins” and “procedures to 
fixate fractures of long bones” . Should the ontology contain axioms to ex- 
tend the procedure definition automatically by adding “. . . to fixate fracture 
of femur”? If so, should the procedure be “fixation of fracture of femur by 
means of insertion of pins in the femur” or “insertion of pins in order to fixate 
fracture of femur”? Ordinarily such “qua-induced” duals are distinct - e.g., 
the “infection caused by a virus” is very different from the “virus caused by 
an infection”. In these cases, two or more logically distinct possible repre- 
sentations are clinically equivalent. Most systems cope with this situation by 
imposing external “guidelines” on domain expert authors to normalize such 
expressions to one form or the other, but the problem is far from solved. 


13.4-3 Semantic normalization and level of intent 

Consider the problem of what constitutes a “surgical procedure” . It is easy 
to agree that all surgical procedures are constituted by an “act” on some 
“thing” which either is, or is located in, an anatomical structure. It is less 
easy to agree on what constitutes an “act” when there is a hierarchy of mo- 
tivations: for example, “inserting pins to fixate a fracture of a long bone” or 
“destruction of a polyp by cautery” or “removal of a polyp (by excision)”. 
Furthermore, important classifications hang on notions of motivation such 
as “palliative surgery” versus “corrective surgery”. In addition, some sys- 
tems wish to be able to record operations just as “correction of X” without 
describing the exact “act” , while others wish to record “insertion of pins in 
fractured bone” without recording that the purpose is fixation. To address 
this problem within Galen, Rossi Mori et al. [1997] proposed a classification 
into four levels: 

L4 clinical goal (palliation, cure); 

L3 physiological goal: (correction, destruction, . . .); 

L2 primary surgical method (excision, insertion, lysis, . . . ); 

LI low level surgical act (cutting, cautery, . . . ). 

It is tempting to believe that a list of concepts in each category could 
be agreed, so that resolution could be done automatically. However, at least 
within the Galen project, intuitions and requirements clashed sufficiently to 
make this difficult. For example, “cautery” can sometimes be a low level act 
or sometimes a primary method. This ambiguity is dealt with in the formal 
ontology by having separate concepts for “simple cautery” and “removal by 
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cauterization”, and by care in formulating the intermediate representation 
(see Subsection 13.5.1). However, achieving consistent usage amongst a range 
of authors with different applications requires vigilance and careful quality 
assurance. 


13.5 Architectures: terminology servers, views, and change 

management 

13.5.1 Intermediate representations and views: Galen’s layered 

architecture 

There is an inevitable conflict between the need for an ontology to be re- 
usable and the requirement that it be easily understood by the domain ex- 
perts who must author and maintain it. Snomed-rt addresses this problem 
by keeping the ontology relatively simple. Galen addresses this problem 
by placing an “intermediate representation” and views (“perspectives”) be- 
tween the re-usable ontology and user-oriented applications [Rector et al, 
1999; 2001]. The intermediate representation and perspective layers in the 
architecture hide complexities irrelevant to the current application from do- 
main experts and other users. It also allows for variations amongst domain 
experts in the vocabulary, structure, and - critically for an international 
project - language. In this layered architecture, the DL ontology is effec- 
tively reduced to a role analogous to that of an assembly language program. 
Using an intermediate representation both allows loose coupling amongst 
authors and simplifies the authoring task. 

Within the Galen project, use of an intermediate representation reduced 
training time for new authors from months to days. It also drastically re- 
duced the time required centrally to harmonize the work of different authors 
so that the resulting classification would pass an agreed quality assurance. 
Prior to the introduction of the intermediate representation, central har- 
monization had consumed over fifty percent of the effort; following intro- 
duction of the intermediate representation this dropped to less than ten 
percent. This is a major saving given that the knowledge engineers required 
for central harmonization take a year or more to train fully. The experi- 
ence of developing the drug ontology in Prodigy (see Subsection 13.2.3) 
has been roughly comparable. In addition, in the drug ontology, the use of 
the intermediate representation has allowed the quality assurance experts to 
participate directly in correcting the authored ontology - something which 
would be entirely impractical in its expanded formulation in the Description 
Logic. 
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13.5.2 Learning vs. building 

Given the scale of medical ontologies, it would obviously be attractive to use 
learning techniques for at least some of their construction. Hahn et al. [1999a] 
are focusing on using language plus the structure of the Unified Medical 
Language System as a major source for inducing their ontology. Campbell 
et al. [1998] have outlined a strategy which makes use of lexical “suggestions” 
to guide manual modeling as part of the Snomed-rt methodology. Galen 
has experimented with various linguistic techniques but so far with limited 
success [Ceusters et al . , 1999]. 


13.5.3 Version and change management 

Any medical ontology for general use must be a living developing struc- 
ture. There are both clinical and technical issues to be dealt with. Campbell 
et al. [1996] have developed a tightly coupled methodology for change man- 
agement in conjunction with Snomed-rt, while Oliver et al. [1999] and 
Cirnino [1996] have discussed the issues of changes in medical vocabulary. 


13.6 Discussion: key lessons from medical ontologies 

Medicine is big and complicated. It has a long tradition of controlled vocab- 
ularies and coding systems. Developing re-usable medical ontologies presents 
at least three major classes of issue to the DL community: 

• Developing implementations which scale. 

• Developing architectures which reconcile the needs of users for simplicity with the 
formal constraints required for tractability and the ontological richness required 
for re-use. 

• Developing formalisms expressive enough to cope with constructs of particular 
concern to medicine, particularly part-whole relations but also other spatio- 
temporal constructs such as adjacency. 

Perhaps most critically, medicine poses the challenge of presenting DL 
notations in forms which users can use to meet real problems - whether 
in representation of medical records, indexing of information for decision 
support, or supporting user interfaces and natural language processing. 
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Abstract 

It has long been realized that the web could benefit from having its content 
understandable and available in a machine processable form. The Semantic 
Web aims to achieve this via annotations that use terms defined in ontologies 
to give well defined meaning to web accessible information and services. 
OWL, the ontology language recommended by the W3C for this purpose, 
was heavily influenced by Description Logic research. In this chapter we 
review briefly some early efforts that combine Description Logics and the 
web, including predecessors of OWL such as OIL and DAML+OIL. We 
then go on to describe OWL in some detail, including the various influences 
on its design, its relationship with RDFS, its syntax and semantics, and a 
range of tools and applications. 1 


14.1 Background and history 

The World Wide Web, while wildly successful in growth, may be viewed as 
being limited by its reliance on languages such as HTML that are focused 
on presentation (i.e. , text formatting) rather than content. Languages such 
as XML do add some support for capturing the meaning of web content 
(instead of simply how to render it in a browser), but more is needed in 
order to support intelligent applications that can better exploit the ever 
increasing range of information and services accessible via the web. Such 


1 This chapter provides an update to the chapter in the first edition of this book entitled “Digital 
Libraries and Web-Based Information Systems” by Ian Horrocks, Deborah L. McGuinness, 
and Christopher A. Welty. That chapter was written prior to the completion of OWL, and 
focused on earlier Description-Logic-based ontology languages for the Semantic Web. Some of 
the material in the current chapter has previously appeared in other forms in conference and 
journal publications, in particular in [Horrocks et al., 2003]. 
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applications are urgently needed in order to avoid overwhelming users with 
the sheer volume of information becoming available. 

The Semantic Web has been envisaged as an evolution of the existing web 
from a linked document repository into an application platform where “infor- 
mation is given well-defined meaning, better enabling computers and people 
to work in cooperation” [Berners-Lee et al, 2001]. This is to be achieved by 
augmenting the existing layout information with semantic annotations that 
add descriptive terms to web content, with the meaning of such terms being 
defined in ontologies. 

In order for the meaning of semantic annotations to be accessible to appli- 
cations (as well as humans), the ontology language being used should have a 
precisely defined semantics and should be amenable to automated process- 
ing. Description Logics appear to be ideally suited to this role: they have 
a formal, logic-based semantics, and are often equipped with decision pro- 
cedures that have been designed with the objective of being implemented 
in automated reasoning systems. This view of the potential place of De- 
scription Logics in the Semantic Web led to the development of a number of 
languages that brought Description Logic concepts to the Semantic Web, cul- 
minating in the development of the Web Ontology Language OWL. OWL is 
the World Wide Web Consortium (W3C) recommended ontology language 
for the Semantic Web, and exploits many of the strengths of Description 
Logics, including well defined semantics and practical reasoning techniques. 

In this chapter we first review briefly the history of Description Logic 
efforts related to the Semantic Web, in particular OIL and DAML+OIL. 
We then go on to describe OWL in some detail, and to show how it brings 
Description Logic concepts fully into the Semantic Web. 


14.1.1 Early uses of Description Logics in the Semantic Web 

Before the development of Description Logic-related languages designed for 
the Semantic Web, there were several systems that used Description Logics 
in the context of the web. We will describe some salient features of two 
systems, Untangle and FindUR, that illustrate early Description Logic 
usage on the web. 

The relationship between hypertext and semantic networks has long been 
realized, but one of the earliest Description Logic systems to realize this re- 
lationship was the Untangle system [Welty and Jenkins, 2000], a Descrip- 
tion Logic system for representing bibliographic (card-catalog) information. 
The Untangle project began as a bit of exploratory research in using De- 
scription Logics for digital libraries [Welty, 1994], but out of sheer temporal 
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coincidence with the rise of the web, a web interface was added and the first 
web-based Description Logic system was born. 

The original Untangle web interface was developed in 1994 [Welty, 
1996a], and combined Lisp-Classic and the CommonLisp Hypermedia 
Server (CL-HTTP) [Mallery, 1994] to implement a hypertext view of the 
ABox and TBox semantic networks, and used nested bullet lists to view the 
concept taxonomy, with in-page cross references for concepts having mul- 
tiple parents. The interface was interesting in some respects as a tool to 
visualize Description Logic and semantic network information, though this 
aspect was never fully developed. 

As the World Wide Web (WWW) became the primary means of dissem- 
ination of Computer Science research, the goals of the Untangle project 
shifted in 1995 to cataloging and classifying pages on the web [Welty, 1996b], 
which was viewed as a massive and unstructured digital library [Welty, 1998] . 

Another early project using Description Logics for the web was the 
FindUR system at AT&T [McGuinness, 1998; McGuinness et al., 1997]. 
The basic notion of FindUR was query expansion , 2 that is, taking syn- 
onyms or hyponyms (more specific terms) and including them in the input 
terms, thereby expanding the query. 

The FindUR system represented a simple background knowledge base 
containing mostly thesaurus information built in a Description Logic (Clas- 
sic) using the most basic notions of Wordnet (synsets and hyper /hyponyms) 
[Miller, 1995]. Concepts corresponding to sets of synonyms (synsets) were 
arranged in a taxonomy. These synsets also contained an informal list of 
related terms. Site-specific search engines (built on Verity - a commercial 
search engine) were hooked up to the knowledge base. Any search term 
would first be checked in the knowledge base, and if it was contained in any 
synset, a new query would be constructed consisting of the disjunction of all 
the synonymous terms, as well as all the more specific terms (hyponyms). 

The background knowledge was represented in Classic, but the Descrip- 
tion Logic was not itself part of the on-line system. Instead, the information 
used by the search engine was statically generated on a regular basis and used 
to populate the search engine. The true power of using a Description Logic as 
the substrate for the knowledge base was realized mainly in the maintenance 
task. The Description Logic allowed the maintainer of the knowledge base 
to maintain some amount of consistency, such as discovering cycles in the 

2 Work on a subsequent Description-Logic-based approach to query expansion addressed some 
formal issues in evaluating the soundness and completeness of alternative approaches [Rousset, 
1999b], Other work on Description Logic (or Description Logic-inspired) approaches to retrieval 
also exists, e.g., [Meghini et al., 1997; Calvanese et al., 1999c], 
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taxonomy and disjoint synsets. These simple constraints proved effective 
tools for maintaining the knowledge since the knowledge itself was very sim- 
ple. 

Additional use of the Description Logic approach in FindUR was real- 
ized in applications that exposed more structured searches, exploiting sub- 
class hierarchies and property relationships, such as the medical applications 
of FindUR in the P-CHIP Primary Care Search Application [McGuinness, 
1999]. This type of structured search, exploiting background ontologies and 
relationships between terms, can also be seen in later work, e.g., in the SHOE 
project [Heflin et al, 2003]. 

14.2 Steps towards integration with the Semantic Web: OIL 

and DAML+OIL 

The first major effort to build a language that combined Description Logics 
and the Semantic Web was OIL (the Ontology Inference Layer) [Horrocks 
et al, 2000a], a part of the On-To-Knowledge research project funded by 
the European Union. The OIL language was explicitly designed as “a web- 
based representation and inference language for ontologies [combining] the 
widely used modeling primitives from frame-based languages with the formal 
semantics and reasoning services provided by description logics” (http : / / 
www. ontoknowledge . org/oil/oilhome . shtml). 

Description Logics provide the semantics for OIL, so much so that the 
semantics of OIL is specified via a mapping to the Description Logic STilQ 
[Fensel et al. , 2001; Horrocks et al. , 1999]. OIL has a syntax based on the 
Resource Description Framework (RDF), as well as an XML syntax, that 
provided the connection to the Semantic Web of the time. 3 OIL allows the 
grouping of Description Logic constructs in a way similar to frame systems, 
providing a more intuitive feel to the language as opposed to the logically 
inspired syntax usually used for Description Logics. These three influences - 
Description Logics, frames, and the Semantic Web - are present not only in 
OIL, but also in all of its successors. 4 

14.2.1 SH 1 Q and SHOIM 

In this subsection we will briefly introduce the syntax and semantics of the 
Description Logics on which OIL, DAML+OIL and OWL are based, i.e. , 

3 At the time that OIL was developed, RDF- the base language of the Semantic Web - was 
without a fully specified semantic foundation. 

4 The correspondence between Description Logics and frame systems is also discussed in Sub- 
section 4.1.2. 
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Construct Name 

Syntax 

Semantics 


atomic concept 
atomic role 
transitive role 
conjunction 
disjunction 
negation 
exists restriction 
value restriction 

A 

R 

R G R+ 
CnD 
CUD 
C 

3R.C 

MR.C 

A 1 C A x 
R 1 C A 1 x A 1 
R 1 = (R x ) + 

c I nD I 

C I UD I 
A I \C I 

{x 3y.(x,y) £ R 1 and y £ C 1 } 
{x \/y.(x,y) £ R 1 implies y £ C 1 } 

S 

role hierarchy 

R U S 

R x C S x 

n 

nominal 

{«} 


O 

inverse role 

R~ 

{( x,y ) | (■ y,x ) £ R 1 } 

1 

number 

restrictions 

^ nP 
^ nP 

{x | Jt {y-(x,y) £ P x } ^ n} 
{x | Jt {y-(x,y) £ P 1 } ^ n} 

M 

qualifying number 
restrictions 

> n P.C 
^ nP.C 

{x | (]{y.(x, y) £ P x and y £ C 1 } ^ n} 
{x | )J{y.(x, y) £ P x and y £ C 1 } ^ n} 

Q 


Fig. 14.1. Syntax and semantics of the S family of Description Logics 

STilQ and SHOTJ\f. These logics are based on an extension of the well- 
known DL ACC [Schmidt-Schaufi and Srnolka, 1991] to include transitively 
closed primitive roles [Sattler, 1996]; this logic has been called S due to 
its relationship with the propositional (multi-) modal logic S4 m [Schild, 
1991]. 5 This logic is then extended to include features such as role inclusion 
axioms (74), nominals ( O ), inverse roles (X) and (possibly qualified) number 
restrictions (Q if qualified, M otherwise). 

The syntax and semantics of these features are summarised in Figure 14.1, 
where A is a concept name, C and D are concepts, R and S are roles, R + is 
the set of transitive roles, o is an individual name, P is a simple role (i.e. , one 
that is not transitive and has no transitive subrole) , and n is a nonnegative 
integer. Further details can be found in Chapter 2, and in [Horrocks et al, 
1999] and [Horrocks and Sattler, 2005]. 

These logics can also be extended with a simple form of concrete domains 
known as datatypes; this is denoted by appending (D) to the name of the 
logic, e.g., SPLOlM (T>). Concrete domains are discussed in detail in Chap- 
ter 6, and the datatype variant is described in [Horrocks and Sattler, 2001]. 

5 This logic has previously been called ACC R +, but this becomes too cumbersome when adding 
letters to represent additional features. 
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14.2.2 OIL 

The OIL language is designed to combine frame-like modeling primitives 
with the increased (in some respects) expressive power, formal rigor and au- 
tomated reasoning services of an expressive Description Logic [Fensel et al, 
2000]. OIL also comes “web enabled” by having both XML- and RDFS- 
based serializations (as well as a formally specified “human readable” form, 
which we will use here). In frame languages, classes (concepts) are described 
by frames, whose main components are a list of superclasses and a list of 
slot-filler pairs. A slot corresponds to a role in a Description Logic, and 
a slot-filler pair corresponds to either a value restriction (a concept of the 
form VI?. C) or an existential quantification (a concept of the form 3R.C) - 
one of the criticisms leveled at frame languages is that they are often unclear 
as to exactly which of these is intended by a slot-filler pair. 

OIL extends this basic frame syntax so that it can capture the full power 
of an expressive Description Logic. 

In order to allow users to choose the expressive power appropriate to 
their application, and to allow for future extensions, a layered family of 
OIL languages was described. The base layer, called “Core OIL” [Bechhofer 
et al, 2000], is a cut-down version of the language that closely corresponds 
with RDFS (i.e., it includes only class and slot inclusion axioms, and slot 
range and domain constraints). The standard language is called “Standard 
OIL”, and when extended with ABox axioms (i.e., the ability to assert that 
individuals and tuples are, respectively, instances of classes and slots), it 
is called “Instance OIL”. Finally, “Heavy OIL” was the name given to a 
further layer that was to include still unspecified language extensions. 

Figure 14.2 illustrates an OIL ontology (using the human readable seri- 
alization) corresponding to an example terminology from Chapter 2. A full 
specification of OIL, including DTDs for the XML and RDFS serializations, 
can be found in [Horrocks et al., 2000a]. 

Standard OIL can be seen as a syntactic variant of the Description Logic 
STilQ [Horrocks et al., 1999] extended with simple concrete datatypes 
[Baader and Hanschke, 1991a; Horrocks and Sattler, 2001], otherwise known 
as SLCIQlfD). Rather than providing the usual model-theoretic semantics, 
OIL defines a translation that maps an OIL ontology into an equivalent 
SLCTQlfD) terminology. From this mapping, OIL derives both a clear seman- 
tics and a means to exploit the reasoning services of Description Logic sys- 
tems such as FaCT [Horrocks, 1998b], Racer [Haarslev and Moller, 2001e] 
and Pellet [Pellet, 2003] that implement (most of) SH1Q(V). 
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name “Family” 

documentation “Example ontology describing family relationships” 
definitions 

slot-def hasChild 
inverse isChildOf 
class-def defined Woman 
subclass-of Person Female 

class-def defined Man 

subclass-of Person not Woman 

class-def defined Mother 
subclass-of Woman 
slot-constraint hasChild 
has-value Person 
class-def defined Father 
subclass-of Man 
slot-constraint hasChild 
has-value Person 
class-def defined Parent 

subclass-of or Father Mother 

class-def defined Grandmother 
subclass-of Mother 
slot-constraint hasChild 
has-value Parent 

class-def defined MotherWithManyChildren 
subclass-of Mother 
slot-constraint hasChild 
min-cardinality 3 

class-def defined MotherWithoutDaughter 
subclass-of Mother 
slot-constraint hasChild 
value-type not Woman 


Fig. 14.2. OIL “family” ontology. 
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14-2.3 The DAML project and DAML+OIL 

At about the same time as OIL was being developed, the DARPA Agent 
Markup Language (DAML) program was started in the United States. 
DAML was initiated in order to provide the foundation for the next gener- 
ation of the web which, it was anticipated, would increasingly utilize agents 
and programs rather than relying so heavily on human interpretation of web 
information [Hendler and McGuinness, 2000]. 

One of the early widely distributed contributions of the DAML program 
was DAML-ONT- a proposal for an ontology language for the web [Hendler 
and McGuinness, 2000; McGuinness et al, 2002], This language began with 
the requirement to build on the best practice in web languages of the time, 
and in particular to extend W3C’s Resource Description Framework, with 
the aim of adding expressive power suited to agent and service interopera- 
tion. 

It became obvious that the goals of DAML-ONT and OIL were so similar 
that these objectives could best be served by combining the two efforts. The 
resulting language, DAML+OIL, has a formal (model-theoretic) semantics 
that provides machine and human understandability [van Harmelen et al., 
2001], an axiomatization [Fikes and McGuinness, 2001], and a reconciliation 
of the language constructors from the two precursor languages. 

DAML+OIL is similar to OIL in many respects, but is more tightly in- 
tegrated with RDFS, which provides the only specification of the language 
and its only serialization. While the dependence on RDFS has some ad- 
vantages in terms of the re-use of existing RDFS infrastructure and the 
portability of DAML+OIL ontologies, using RDFS to completely define 
the syntax of DAML+OIL is quite difficult as, unlike XML, RDFS is not 
designed for the precise specification of syntactic structure. For example, 
there is no way in RDFS to state that a restriction (slot constraint) should 
consist of exactly one property (slot) and one class, and as a result the fol- 
lowing axiom is perfectly legal in DAML+OIL, in spite of the fact that the 
restriction specifies two properties and two classes: 

<daml: Class rdf : ID="Person"> 

<rdf s : subClassOf > 

<daml : Restriction 

<daml : onProperty rdf : resource="#hasFather"/> 
<daml : onProperty rdf : resource="#hasMother"/> 
<daml :toClass rdf : resource="#Man"/> 

<daml :toClass rdf : resource="#Woman"/> 

</daml : Restriction 
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</rdf s : subClassOf > 

</daml : Class> 

The solution to this problem adopted by DAML+OIL is to define the 
semantics of the language in such a way that it gives a meaning to any (parts 
of) ontologies that conform to the RDFS specification, including “strange” 
constructs such as the one shown above. This is made easier by the fact that, 
unlike OIL, the semantics of DAML+OIL is directly defined, although a 
translation into a suitable DL would also have been possible: the above 
DAML+OIL axiom is, for example, equivalent to the following DL axioms: 

Person C EhasFather.Man 
Person C dhasFather. Woman 
Person C ElhasMother.Man 
Person C EhasMother. Woman 
ElhasFather.Man = ElhasFather. Woman 
ElhasFather. Woman = ElhasMother.Man 
EhasMother.Man = EhasMother. Woman. 

This may seem strange, but has the advantage of giving a consistent and 
unambiguous semantics to DAML+OIL restrictions consisting of multiple 
properties and/or classes. The DAML+OIL specification strongly recom- 
mends, however, that the use of such restrictions (and other “strange” con- 
structs) be avoided. 

Another effect of DAML+OIL’s tight integration with RDFS is that 
the frame structure of OIL’s syntax is much less evident: a DAML+OIL 
ontology is more Description Logic-like in that it consists largely of a rel- 
atively unstructured collection of subsumption and equality axioms. This 
can make it more difficult to use DAML+OIL with frame-based tools such 
as Protege [Grosso et al, 1999] or OilEd [Bechhofer et al, 2001b], be- 
cause the axioms may be susceptible to many different frame-like groupings 
[Bechhofer et al. , 2001a]. 

From the point of view of language constructs, the differences between 
OIL and DAML+OIL are relatively trivial. Although there is some dif- 
ference in “keyword” vocabulary, there is usually a one-to-one mapping of 
constructors, and in the cases where the constructors are not completely 
equivalent, simple translations are possible. 

The initial release of DAML+OIL did not include any specification of 
datatypes. The language was, however, subsequently extended with arbi- 
trary datatypes from the XML Schema type system, which can be used in 
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restrictions (slot constraints) and range constraints. As in STLOQifD) [Hor- 
rocks and Sattler, 2001], a clean separation is maintained between instances 
of “object” classes (defined using the ontology language) and instances of 
datatypes (defined using the XML Schema type system). In particular, it is 
assumed that the domain of interpretation of object classes is disjoint from 
the domain of interpretation of datatypes, so that an instance of an object 
class (e.g., the individual Italy) can never have the same interpretation as a 
value of a datatype (e.g., the integer 5), and that the set of object properties 
(which relate individuals to individuals) is disjoint from the set of datatype 
properties (which relate individuals to datatype values). 


14.3 Full integration into the Semantic Web: OWL 

With its tighter integration with RDF and RDFS, DAML+OIL was the 
first Description Logic-inspired language to be completely integrated into the 
fabric of the Semantic Web (as it was then defined). There was, however, 
no formal semantic integration of DAML+OIL with RDF and RDFS, 
as RDF and RDFS did not have a formal semantics at the time when 
DAML+OIL was being developed. DAML+OIL also lacked any formal 
status, having been the product of a group of researchers operating under 
the auspices of the so-called “Joint EU/US ad hoc Agent Markup Language 
Committee”. This group published DAML+OIL as a W3C Note on De- 
cember 18, 2001 (see http://www.w3.org/Submission/2001/12/). 

To fix these problems, the W3C chartered two working groups in 2001. 
The RDF Core Working Group was to be responsible for updating the RDF 
recommendation and providing a formal semantics for RDF and thus for the 
Semantic Web. The Web Ontology Working Group was to be responsible for 
designing an ontology language for the web, compatible with the new version 
of RDF. The Web Ontology Working Group developed the Web Ontology 
Language OWL [Bechhofer et al., 2004], which became a W3C recommenda- 
tion on 10 February 2004 (see http://www.w3.org/2004/0WL/). In parallel, 
the RDF Core Working Group developed a formal semantics for RDF and 
RDFS [Hayes, 2004], 

The OWL recommendation defines a family of three languages in order 
to address concerns ranging from level of expressiveness to degree of com- 
patibility with RDF. Two of these so-called “species” of OWL, OWL Full 
and OWL DL, provide the same set of Description Logic constructors, but 
differ in the way the constructors can be used: in OWL Full they can be 
used in arbitrary RDF graphs, whereas in OWL DL their use is restricted 
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so that OWL DL ontologies correspond directly to Description Logic knowl- 
edge bases. The third species of OWL, OWL Lite, is very similar to OWL 
DL, but provides a reduced set of constructors. 


14-3.1 Influences on the Design of OWL 

The design of OWL was subject to a variety of influences. These in- 
cluded influences from established formalisms and knowledge representation 
paradigms, influences from existing ontology languages, and influences from 
existing Semantic Web languages. 

Some of the most important influences on the design of OWL came, 
via its predecessor DAML+OIL, from Description Logics, from the frames 
paradigm, and from RDF. In particular, the formal specification of the lan- 
guage was influenced by Description Logics, the surface structure of the 
language (as seen in the abstract syntax) was influenced by the frames 
paradigm, and the RDF /XML exchange syntax was influenced by a re- 
quirement for upwards compatibility with RDF. 

Description Logics, and insights from Description Logic research, had a 
strong influence on the design of OWL, particularly on the formalization of 
the semantics, the choice of language constructors, and the integration of 
datatypes and data values (see, e.g., Chapters 2-6). In fact OWL DL and 
OWL Lite (two of the three species of OWL) can be viewed as expressive 
Description Logics, with an ontology being equivalent to a Description Logic 
knowledge base. More precisely, OWL DL and OWL Lite are equivalent to 
D) and SHOlN (D) respectively (see Subsections 14.3.3 and 14.3.5 
for more details). 

This design was motivated by practical considerations. The designers of 
OWL wanted to have some idea as to how difficult it would be for tools 
and applications to support the language. It was therefore important to 
understand its formal properties, e.g., with respect to the decidability and 
complexity of key inference problems. These properties followed directly 
from the correspondences with Description Logics (see Chapter 3). These 
correspondences would allow tools and applications to exploit known rea- 
soning algorithms and even (highly optimized) implementations (see, e.g., 
Chapters 7-9). 

In the Semantic Web context, where users with a wide range of expertise 
might be expected to create or modify ontologies, readability and general 
ease of use are important considerations for an ontology language. In the 
design of OIL [Fensel et al., 2001], these requirements were addressed by 
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providing a surface syntax based on the frames paradigm. Frames group 
together information about each class, making ontologies easier to read and 
understand, particularly for users not familiar with (Description) Logics. 
The frames paradigm has been used in a number of well-known knowledge 
representation and ontology environment systems including the Protege 
ontology design tool [Grosso et al ,, 1999], the Ontolingua ontology environ- 
ment tool [Farquhar et al, 1996], the OKBC knowledge model [Chaudhri 
et al, 1998b], and the Chimaera Ontology Evolution Environment [McGuin- 
ness et al, 2000b]. The design of OIL was influenced by XOL [Karp et al, 
1999] - a proposal for an XML syntax for OKBC Lite (a cut-down version 
of the OKBC knowledge model). 

In frame-based languages, each class is described by a frame. The frame 
includes the name of the class, identifies the more general class (or classes) 
that it specializes, and lists a set of “slots” . A slot may consist of a property- 
value pair, or a constraint on the values that can act as slot “fillers” (in 
this context, value means either an individual or a data value). This struc- 
ture was used in the OIL language, with some enrichment of the syntax 
for specifying classes and slot constraints so as to enable the full power 
of a Description Logic-style language to be captured. In addition, prop- 
erty frames were used to describe properties, e.g., specifying more general 
properties, range and domain constraints, transitivity and inverse property 
relationships. 

A class frame is semantically equivalent to a Description Logic axiom as- 
serting that the class being described by the frame is a subclass of each of 
the classes that it specializes and of each of the property restrictions corre- 
sponding to the slots. As well as a richer slot syntax, OIL also offered the 
possibility of asserting that the class being described by the frame was ex- 
actly equivalent to the relevant intersection class (i.e. , that they were mutu- 
ally subsuming). A property frame is equivalent to a set of axioms asserting 
the relevant subproperty relationships, range and domain constraints etc. 
OIL was designed so that OIL frames could easily be mapped to equivalent 
axioms in the S7iOQ(T>) Description Logic [Decker et al, 2000]. 

The formal specification and semantics of OWL are given by an abstract 
syntax [Patel-Schneider et al, 2004] that has been heavily influenced by 
frames in general and by the design of OIL in particular. In the abstract 
syntax, axioms are compound constructions that are very like an OIL-style 
frame. For classes, they consist of the name of the class being described, a 
modality of “partial” or “complete” (indicating that the axiom is asserting a 
subclass or equivalence relationship respectively) , and a sequence of property 
restrictions and names of more general classes. Similarly, a property axiom 
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specifies the name of the property and its various features. 

The frame style of the abstract syntax, which borrows heavily in spirit 
from the human readable syntax for OIL, makes it much easier to read 
(compared to the RDF/XML syntax), and also easier (for non- logicians) 
to understand and to use. Moreover, abstract syntax axioms have a direct 
correspondence with Description Logic axioms, and they can also be mapped 
to a set of RDF triples (as we will see in the following subsection). 

The third major influence on the design of OWL was the requirement to 
maintain the maximum upwards compatibility with existing web languages, 
and in particular with RDF [Manola and Miller, 2004], On the face of it 
this requirement made good sense as RDF (and in particular RDF Schema) 
already included several of the basic features of a class- and property-based 
ontology language, e.g., it allows subclass and subproperty relationships to 
be asserted. Moreover, the development of RDF preceded that of OWL, 
and it seemed reasonable to try to appeal to any user community already 
established by RDF. 

It may seem easy to meet this requirement simply by giving OWL an 
RDF-based syntax, but, in order to provide maximum upwards compati- 
bility, it was also thought necessary to ensure that the semantics of OWL 
ontologies was consistent with the semantics of RDF. This proved to be 
difficult, however, given the greatly increased expressive power provided by 
OWL. This will be discussed in more detail in Subsection 14.3.2. 


14-3.2 Layering (on RDFS) and three “species” of OWL 

On the face of it, maintaining maximum upwards compatibility with RDF 
made good sense, but this requirement led to a number of problems in the 
design of OWL. 

In the first place, RDF /XML is extremely verbose. Compare, for example, 
information about a class as it would be given in a Description Logic syntax 

Student = Person n ^ 1 enrolledln 

(a Student is a Person who is enrolledln at least 1 thing), or the OWL 
abstract syntax 

Class(Student partial Person 

restriction(enrolled!n minCardinality(l))) 


with how it would most naturally be written using the OWL RDF /XML 
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syntax 6 

<owl: Class rdf : ID=" Student "> 

<owl : intersectionOf rdf : parsetype="Collection"> 

<owl: Class rdf s : about="Person" /> 

<owl : Restriction 

<owl : onProperty rdf : resource="enrolledIn" /> 

Cowl :minCardinality rdf s : datatype="&xsd; Integer "> 

1 

</ owl : minCardinality> 

</ owl : Restriction 
c/owl : intersectionOf > 
c/owl : Class> 

This verbosity of OWL’s RDF /XML syntax may not, in itself, be a se- 
rious problem given the capabilities and bandwidths of modern computers 
and communication systems, and that this syntax is mainly intended for 
exchanging data between OWL applications. The RDF /XML syntax does, 
however, lead to some more serious problems. RDF is itself a graph-based 
formalism, with graphs-expressed as sets of subject-predicate-object triples, 
where each triple represents a labeled edge (the predicate) connecting two 
vertices (the subject and object). This means that many OWL constructs, 
such as property restrictions, have to be encoded as several triples. There is 
no requirement that these triples occur together, so parsing becomes difficult 
as it may be necessary to scan the entire input in order to locate all of the 
components of a given construction. Moreover, circular and other unusual 
structures with ill defined meanings cannot be ruled out. 

Even more problematical is the relationship between the semantics of an 
OWL ontology and the semantics of the RDF triples used to encode it. This 
was not as much of an issue when OIL and DAML+OIL were designed, as at 
that time the meaning of RDF was not precisely specified. OIL in particular 
did not bother to relate the RDF meaning of its RDF /XML syntax to the 
OIL meaning of this syntax - the RDF /XML syntax for some OIL constructs 
does more or less line up with the RDF meaning of these constructs, but 
this is by no means the case for all such constructs. 

DAML+OIL did a better job of abiding by the RDF meaning of its syn- 
tax. The DAML+OIL model theory [van Harmelen et al . , 2001] included 
a semantic condition for triples that was close to the RDF meaning (as 
defined at that time) for triples. Moreover, DAML+OIL used the built- 

6 Full details oil the OWL RDF /XML syntax can be found in the OWL reference document 

[Bechhofer et al., 2004], 
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in RDF and RDFS vocabulary to a greater extent than did OIL, and 
used it in a way generally compatible with the RDF or RDFS mean- 
ing. For example DAML+OIL uses rdf s : subClassOf to relate classes to 
superclasses. 

By the time OWL was being designed, RDF was being given a formal 
meaning via the RDF model theory [Hayes, 2004], It proved to be difficult 
to design an RDF syntax for OWL such that the Description Logic seman- 
tics assigned by OWL to its various constructs was compatible with the 
semantics of the RDF triples used to encode them. Moreover, there was an 
incompatibility between the requirement to be fully backwards compatible 
with OWL (i.e. , to allow arbitrary RDF graphs to be interpreted as OWL 
ontologies), and the requirement for OWL to be equivalent to an expressive 
Description Logic. (See [Horrocks et al . , 2003] for a detailed discussion of 
these issues.) 

These problems were eventually “resolved” by defining two “species” of 
OWL: OWL Full and OWL DL. Only a subset of RDF graphs correspond 
to OWL DL ontologies. In particular, the graph cannot include cyclical and 
other problematical constructions (including “malformed” OWL syntax), 
and the sets of class, property, and individual names must be disjoint. For 
RDF graphs that satisfy these syntactic restrictions, a fairly standard De- 
scription Logic-style model theory is used to define the semantics of the 
resulting OWL constructs. In contrast, OWL Full uses an RDF-style model 
theory to give a semantic account of the use of OWL constructors in arbi- 
trary RDF graphs. (See Subsections 14.3.4 and 14.3.5 for more details.) 

As well as arguments about the importance of compatibility with RDF, 
there were also tensions within the working group regarding the expressive 
power that was appropriate for OWL, with some members arguing that 
OWL DL was too complex to be understood by new users or implemented 
by application developers. Weight was lent to these arguments by the fact 
that, at the time, no tableaux decision procedure was known for SPLOIN (D) 
(the Description Logic language underlying OWL DL), and no implemented 
system supported it. In response to these concerns, a third OWL species 
was defined, namely OWL Lite. 

OWL Lite is a syntactic subset of OWL DL that prohibits and/or restricts 
the use of certain constructors and axioms with the aim of making the 
language easier to understand and/or implement. It has subsequently been 
shown, however, that by combining other constructors and axioms, most of 
the expressive power of OWL DL can be regained, and that OWL Lite is 
expressively equivalent to SPLIT (T)) [Horrocks and Patel-Schneider, 2003]. 
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Given that OWL Full does not correspond either syntactically or seman- 
tically to a Description Logic, in the remainder of this chapter we will focus 
our attention on OWL DL and OWL Lite. 


14-3.3 OWL DL abstract syntax and semantics 

OWL DL has some differences from standard Description Logics. These 
differences provide a bridge between the formal Description Logic world and 
the Semantic Web world. 

• OWL uses URI references as names, and constructs these URI references in the 
same manner as that used by RDF. It is thus common in OWL to use qualified 
names as shorthand for URI references, using, for example, the qualified name 
owl:Thing for the URI reference http://www.w3.Org/2002/07/owl#Thing. 

• OWL gathers information into ontologies, which are generally stored as Web 
documents written in RDF /XML. Ontologies can import other ontologies, adding 
the information from the imported ontology to the current ontology. 

• OWL allows RDF annotation properties to be used to attach information to 
classes, properties, and ontologies, such as owl :DeprecatedClass. These annota- 
tions are RDF triples, and are therefore required to carry a full semantic weight in 
RDF. In OWL DL, however, such annotations carry a separate, limited meaning. 

• OWL uses the facilities of RDF datatypes and XML Schema datatypes to pro- 
vide datatypes and data values (a very restricted form of concrete domains; see 
Subsection 6.2.1). 

• OWL DL and OWL Lite have a frame-like abstract syntax, whereas RDF /XML 
is the official exchange syntax for all three species of OWL. 

As mentioned above, OWL DL is very closely related to SHOIAf (D) , 
which extends STiOLQ [Horrocks and Sattler, 2005] with datatypes like 
those in S7iOQ{V) [Horrocks and Sattler, 2001], but allows only unqual- 
ified number restrictions (see Subsection 2.4.2). OWL DL can form de- 
scriptions of classes, datatypes, individuals and data values using the con- 
structs shown in Figure 14.3. In this table the first column gives the OWL 
abstract syntax for the construction, while the second column gives the 
equivalent Description Logic syntax. The letters A, D, R, U, o, and v 
represent, respectively, names for classes (concepts), data ranges, object 
properties (abstract roles), datatype properties (concrete roles), individu- 
als (nominals), and data values; C , possibly subscripted, represents an ar- 
bitrary class description. In OWL, data values are RDF literals (i.e., in- 
stances of datatypes such as string or integer), and all other names are 
URI references. As mentioned above, owl: Thing and owl: Nothing are 
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Abstract Syntax 

DL Syntax 

Descriptions ( C ) 


A 

A 

owl : Thing 

T 

owl : Nothing 

_L 

intersectionOf (C\ . . . C n ) 

Ci n • • • n c n 

unionOf (C \ . . . C n ) 

Cl U • • • U C n 

complementOf ( C ) 

—<c 

oneOf (oi . . . On) 

{ 01 } U • • • U {o n } 

restriction!/? someValuesFrom(C) ) 

3R.C 

restriction!/? allValuesFromCC) ) 

MR.C 

restriction!/? hasValue(o)) 

R : o 

restriction(i? minCardinality (n) ) 

^ nR 

restriction(i? maxCardinality in) ) 

^ nR 

restriction^/ someValuesFrom(D) ) 

3U.D 

restriction(C7 allValuesFrom(D) ) 

VU.D 

restriction^/ hasValue(u)) 

U : v 

restriction (JJ minCardinality (n) ) 

^ nU 

restriction (JJ maxCardinality (n) ) 

^ nU 

Data Ranges (D) 


D 

D 

oneOf (ui . . . v n ) 

{n} u...u {v n } 

Object Properties (R) 


R 

R 

in v(i?) 

R~ 

Datatype Properties ( U ) 


U 

U 

Individuals (o) 


o 

o 

Data Values (v) 


V 

V 


Fig. 14.3. OWL DL descriptions, data ranges, properties, individuals, and data 
values 


shorthand for the URI references http : //www . w3 . org/2002/07/owl#Thing 
and http://www.w3.Org/2002/07/owl#Nothing respectively. 

The treatment of concrete values in OWL is somewhat different from 
the usual treatment of concrete values in Description Logics [Baader 




OWL: a Description-Logic-Based Ontology Language for the Semantic Web 475 

and Hanschke, 1991a; Lutz, 2002], and is instead based on the more re- 
stricted concrete datatypes introduced in [Horrocks and Sattler, 2001]. 
OWL uses datatypes from XML Schema datatypes as its concrete types, 
so xsd: integer is a concrete type in OWL, namely the type of integers. 
OWL uses the RDF syntax for these concrete values, so "2"'' "xsd: integer 
is the way to write the integer 2. OWL also allows plain RDF lit- 
erals, which are a combination of a string and an optional language 
tag. These plain literals belong to the datatype rdf s : Literal. OWL 
also allows sets of data values to be used in concept expressions, as in 
oneOf (" l"~"xsd: integer "2 """xsd: integer "3" ""xsd: integer) . 

OWL uses these description-forming constructs in axioms that provide in- 
formation about classes, properties, and individuals, as shown in Figure 14.4. 
Again, the frame-like abstract syntax is given in the first column, and the 
standard Description Logic syntax is given in the second column. The nota- 
tion is the same as in Figure 14.3. 

Either partial or complete information can be stated about a class, as in 

Class (ex: Country partial owl: Thing) 

Class (ex: Person partial owl: Thing) 

Class (ex: Student partial ex: Person) 

Class (ex: Canadian complete ex: Person 

hasValue (ex : nationality ex : Canada) ) 

which makes ex: Country and ex: Person classes, ex: Student a sub- 
class of ex: Person, and ex: Canadian precisely those people who have 
ex nationality ex : Canada . 1 

In OWL DL properties are divided into object properties, such as 
ex nationality, which relate individuals to other individuals, datatype 
properties, such as ex: age, which relate individuals to data values, and 
annotation properties, which can be used to add uninterpreted information 
(such as versioning information) to individuals, classes, and properties. Con- 
straints, such as domains and ranges, can be given for object properties and 
datatype properties, but not for annotation properties. 

DatatypeProperty (ex : age domain (ex : Person) 

range (xsd : integer) ) 

□bjectProperty (ex nationality domain (ex : Person) 

range (ex : Country) ) 


7 For a more extensive example of how to use OWL, see the OWL guide [Smith et al., 2004], 
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Abstract Syntax 

DL Syntax 

Class (A partial C\...C n ) 

A c Ci n • • • n C n 

Class (A complete C \ . . . C n ) 

A = Ci n • • • n C n 

EnumeratedClass (A o\ . . . o n ) 

A = {01} U • • • U {o n } 

SubClassOf (Ci C 2 ) 

Ci CC 2 

EquivalentClasses (C*i . . . C n ) 

Cl = • • • = C n 

DisjointClasses(Ci . . . C n ) 

CCCjEl,!/] 

Datatype ( D ) 


ObjectProperty(i? super (i?i) . . . super (R n ) 

R C Ri 

domain(Ci) . . . dornain(C m ) 

^1 RGCi 

range (Ci) . . . range (Cg) 

T C VR.Ci 

[inverseOf (Ro)] 

R = Rq 

[Symmetric] 

R = R~ 

[Functional] 

TC (1 R 

[ Inver seFunctional] 

TEC R~ 

[Transitive] ) 

Tr(R ) 

SubPropertyOf (i?i R 2 ) 

R\ E R2 

EquivalentProperties(i?i . . .R n ) 

Ri = • • • = R n 

DatatypeProperty (U super (Ci) . . . super (U n ) 

U\ZUi 

domain(Ci) . . .dornain(C m ) 

^ 1 C C Ci 

range (Di) . . . range (DO 

T C VC. A: 

[Functional] ) 

TEC C 

SubPropertyOf (JJ\ U 2 ) 

Ci C C 2 

EquivalentProperties((/i . . . U n ) 

Cl = • • • = u n 

AnnotationProperty (S') 


OntologyProperty (S) 


Individual (o type(Ci) . . . typ e(C n ) 

0 G Cj 

value (R\ Oi) . . . value (R n o n ) 

(0, Oi) G Ri 

value((/i ri) .. .value (JJ n r n )) 

(0, Vi) G C. 

Samelndividual (oi . . . o r ) 

{01} = • • • = {o n } 

Diff erent!ndividuals(oi . . . o n ) 

{oi} C 


Fig. 14.4. OWL DL axioms and facts 


Object properties can also be specified to be transitive, symmetric, func- 
tional, and inverse functional. In order to retain decidability, OWL DL 
restricts how these specifications can be combined for a particular object 
property: properties that are specified as being transitive, and their super- 
properties and their inverses, cannot have their cardinality restricted, either 
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via the functional part of property axioms or in cardinality restrictions (see 
[Horrocks et al., 1999]). 

Annotation properties are a way of associating uninterpreted infor- 
mation with classes, properties, and individuals. The syntax for an- 
notations is not given in Figure 14.4 (as they do not contribute to 
the corresponding Description Logic syntax), but some examples of 
their use are given below. Many axioms (Class, EnumeratedClass, 
Datatype, ObjectProperty, DatatypeProperty, AnnotationProperty, 
OntologyProperty, and Individual axioms) can have an annotation that 
provides uninterpreted information about that class, property, or individual. 
To prevent annotations influencing the semantics of OWL DL, little can be 
said about annotation properties: they cannot participate in restrictions, 
nor can they be given domains, ranges, or any other aspect of other kinds of 
properties. These limitations result in only trivial inferences being possible 
in relation to annotation properties in OWL DL. 

One use of annotation properties is to provide comment information for 
classes and properties, as in 

Class (ex: Country partial 

annotation (rdf s : comment "Countries of the world") 
owl : Thing) 

This is, perhaps, the most useful purpose for annotations in OWL DL. 

Information about individuals can also be provided in OWL, for either 
named individuals or anonymous individuals, as in: 

Individual (ex : Canada type (ex : Country) ) 

Individual (ex : England type (ex : Country) ) 

Individual (ex : Peter type (ex : Canadian) 

value (ex : age "48" ~~xsd : integer) ) 

Individual (value (ex : nationality ex : England) 
value (ex : age "44" ~ "xsd : integer) ) 

In OWL, axioms and facts are grouped into ontologies, with the result 
that an OWL DL ontology is equivalent to a Description Logic knowledge 
base, i.e. , a TBox plus an ABox (see Chapter 2). This is not completely 
standard, as ontologies are more typically thought of as describing only 
the structure of a domain (in terms of classes and properties), and not 
as describing a particular situation (in terms of instances of classes and 
properties); in this more common usage, an ontology is therefore equivalent 
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to a Description Logic TBox, and not to the combination of a TBox and an 
ABox. 

Ontologies can be given annotations, just like classes, properties, and indi- 
viduals. Ontologies can also have ontology properties as annotations. Most 
ontology properties act just like annotation properties (i.e. , they provide 
uninterpreted information), but one special property, owl: imports, can be 
used to “import” other ontologies. Importing an ontology effectively treats 
the content of a web page as part of the current ontology, as in: 

Ontology (Simple 

Annotation (rdf s : comment "A simple ontology for nationality") 
Annotation (owl : imports http : //www. f oo . ex/simpler . owl) 


) 


The meaning of the above imports statement is that the set of axioms in 
the Simple ontology is taken to include the set of axioms in the simpler . owl 
ontology. 


14-3.4 Semantics for OWL DL 

A formal semantics, very similar to the semantics provided for Description 
Logics (see Subsection 2.2.1), is provided for this style of using OWL. Full 
details on this model theory can be found in the OWL Semantics and Ab- 
stract Syntax [Patel-Schneider et al., 2004]. 

Because OWL includes datatypes, the semantics for OWL is very similar 
to that of Description Logics that also incorporate datatypes, in particular 
STLOQifD). However, the particular datatypes used in OWL are taken from 
RDF and XML Schema Datatypes [Biron and Malhotra, 2001]. Data values 
such as "44" ~~xsd: integer thus mean what they would mean as XML 
Schema data values. 

The semantics for OWL DL does include some aspects that may be viewed 
as unusual from a Description Logic perspective. Annotations are given a 
simple separate meaning, not shown here, that can be used to associate in- 
formation with classes, properties, and individuals in a manner compatible 
with the RDF semantics. Ontologies also live within the semantics and can 
be given annotation information. Finally, owl: imports is given a meaning 
that involves finding the referenced ontology (if possible) and adding its 
meaning to the meaning of the current ontology. In other respects, however, 
the meaning of an OWL ontology should be exactly equivalent to the mean- 
ing of the Description Logic knowledge base derived via the correspondences 
given in Figures 14.3 and 14.4. 
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What makes OWL DL (and OWL Lite) a Semantic Web language, there- 
fore, is not its semantics, which is quite standard for a Description Logic, 
but instead the use of URI references for names, the use of XML Schema 
datatypes for data values, and the ability to connect to documents in the 
World Wide Web. 


14-3.5 OWL Lite and OWL Full 

As we have seen, OWL DL is related to S7iOTJ\f(D), a very expressive 
Description Logic. This Description Logic is somewhat difficult to present to 
naive users, as it is possible to build complex Boolean descriptions using, for 
example, union and complement. STiOLAf (D) is also difficult to reason with, 
as key inference problems have NExpTinre complexity, and the potential 
for complex descriptions makes it somewhat difficult to build tools such as 
editors. 

For these reasons, a subset of OWL DL has been identified that should 
be easier on all the above metrics; this subset is called OWL Lite. OWL 
Lite prohibits unions and complements, restricts intersections to the implicit 
intersections in the frame- like class axioms, limits all embedded descriptions 
to class names, does not allow individuals to occur in descriptions or class 
axioms, and limits cardinalities to 0 or 1. 

These restrictions make OWL Lite expressively equivalent to STiLJ-fD). 
As in STCLJ-fD), key inferences in OWL Lite can be computed in worst-case 
exponential time (ExpTime), and there are already several optimized reason- 
ers for logics equivalent to OWL Lite (see Subsection 14.3.7). This improve- 
ment in tractability comes with relatively little loss in expressive power - 
although OWL Lite syntax is more restricted than that of OWL DL, it is 
still possible to express complex descriptions by introducing new class names, 
and exploiting the duality between allValuesFrom and someValuesFrom re- 
strictions [Horrocks and Patel-Schneider, 2003] . For example, the OWL Lite 
axioms 

Class (C complete A B) 

ObjectProperty (p) 

Class (C complete restriction(p allValuesFrom(Nothing) ) ) 

Class (D complete restriction(p someValuesFrom (Thing) ) ) 

result in the class D being equivalent to the union of the negations of A and 
B, i.e. , D = -i A U -i B. Using these techniques, all OWL DL descriptions can 
be captured in OWL Lite except those containing either individual names 
or cardinalities greater than 1. 
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OWL DL and OWL Lite are extensions of a restricted use of RDF and 
RDFS, because, unlike RDF and RDFS, they do not allow classes to be 
used as individuals, and the language constructors cannot be applied to the 
language itself. For users who want these capabilities, a version of OWL that 
is upward compatible with RDF and RDFS has been provided; this version 
is called OWL Full. In OWL Full, all RDF and RDFS combinations are 
allowed. For example, in OWL Full, it is possible to impose a cardinality 
constraint on rdf s : subClassOf , if so desired. 

OWL Full contains OWL DL, but goes well outside the standard De- 
scription Logic framework. The penalty to be paid here is two-fold. First, 
reasoning in OWL Full is undecidable. Showing the undecidability is trivial, 
because restrictions required in order to maintain the decidability of OWL 
DL do not apply to OWL Full [Horrocks et al, 1999], but also as a result of 
the ability to apply OWL’s expressive power to RDF syntax, as exemplified 
above [Motik, 2005] . Second, the abstract syntax for OWL DL is inadequate 
for OWL Full (the abstract syntax would not, for example, allow for the 
representation of cyclical RDF graphs or for graphs that contain only parts 
of OWL DL syntactic constructs), and the official OWL exchange syntax, 
RDF/XML, must be used. 

OWL Full has been given a model-theoretic semantics that is a vocabulary 
extension of the RDF model theory [Patel-Schneider et al., 2004; Hayes, 
2004] . A correspondence between this semantics and the semantics of OWL 
DL has also been established: it has been shown that the model theory for 
OWL DL has very similar consequences to this RDF-style model theory for 
those OWL ontologies that can be written in the OWL DL abstract syntax 
[Patel-Schneider et al., 2004], 

More formally, given two OWL DL ontologies 0\ and O 2 , written in the 
abstract syntax, if (D\ entails O 2 according to the OWL DL model theory 
then the mapping of 0\ into RDF triples will entail the mapping of O 2 into 
RDF triples according to the OWL Full model theory. 

The converse, however, is not true. Although the correspondence is usually 
exact, it has been shown that there are at least some pathological cases where 
the correspondence breaks down. For example, in OWL Full owl : Thing 
contains individuals corresponding to the RDFS and OWL vocabularies 
(such as rdf : Property, rdf s : subClassOf , and owl :hasValue). (There are 
no such individuals in the model theory for OWL DL.) Limiting the extent 
of owl : Thing, for example as in 

SubClassOf (owl : Thing oneOf (ex : f 00 ) ) 
forces equalities in this vocabulary, which have truly unusual effects in 
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the OWL Full model theory. For example, because there would be only 
one individual, and OWL syntax tokens are individuals in OWL Full, an 
allValuesFrom restriction would imply a someValuesFrom restriction. 

In order to avoid any possible confusion as to the meaning of OWL DL, 
the OWL Full model theory has been given “non-normative” status (i.e., it 
is only informative) for OWL ontologies that can be written in the abstract 
syntax - for such ontologies, the definitive semantics is given by the OWL 
DL model theory. 


14-3.6 OWL datatypes 

As well as dealing with “abstract” classes such as Person and Animal, many 
practical applications need to represent and reason about datatypes and 
values such as integers and strings. The integration of datatypes in the 
OWL language is again heavily influenced by Description Logic research, 
which has demonstrated that care is required in order to avoid complexity 
blow-ups or even undecidability being caused by datatypes [Lutz, 2002]. In 
the SftOQ{V) Description Logic it was shown that this could be achieved 
by strictly separating the interpretation of datatypes and values from that 
of classes and individuals: SHOQ(V ) interpretations include an additional 
interpretation domain for data values which is disjoint from the domain 
of individuals A 1 . Datatypes, such as integer, are interpreted as a subset 
of Aq, and values such as the integer “35” are interpreted as elements 
of A^. The separation is further strengthened by dividing properties into 
two disjoint sets of abstract and datatype properties. Abstract properties 
such as brother are interpreted as binary relations on A x (i.e., subsets of 
A 1 x A 1 ), while datatype properties such as age are interpreted as binary 
relations between A 1 and Ap (i.e., subsets of A 1 x A^). 

This design has the advantage that reasoning with datatypes and values 
can be almost entirely separated from reasoning with classes and individu- 
als - a class-based reasoner simply needs access to a datatype “oracle” that 
can answer simple questions with respect to datatypes and values (e.g., “is 
—5 a nonNegative Integer?”). Moreover, the language remains decidable if 
datatype and value reasoning is decidable, i.e., if the oracle can guarantee 
to answer all questions of the relevant kind for supported datatypes. This 
can easily be achieved for a range of common datatypes such as integers, 
decimals, and strings [Lutz, 2002]. 

As well as these practical considerations, it can also be argued that the 
separation of classes and datatypes makes sense from a philosophical stand- 
point as datatypes are already structured by built-in predicates such as 
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greater-than and less-than. From this point of view, it does not make sense 
to use ontology axioms to add further structure to datatypes or to form 
“hybrid” classes such as the class of red integers. 


14-3.7 Reasoners, tools, and applications 

As discussed in Subsection 14.3.1, an important motivation for the design of 
OWL DL and OWL Lite was the ability for applications to exploit known 
reasoning algorithms and existing (highly optimized) reasoner implemen- 
tations. This meant that, even before the OWL specification was finalized, 
prototype tools and applications could make use of Description Logic reason- 
ing systems such as FaCT [Horrocks, 1998a], Pellet [Pellet, 2003] and Racer 
[Haarslev and Moller, 2001e]. The use of these systems was also facilitated 
by the fact that they provide a standard application interface designed by 
the Description Logic Implementation Group (DIG) [Bechhofer et al, 1999]. 

At the time when OWL achieved recommendation status (the final stage 
in the W3C standardization process), the above systems were only able to 
support OWL Lite reasoning. This was because, at that time, no suitable 
algorithm was known for STiOlM, and all of the implementations were 
based on the STilQ algorithms described in [Horrocks et al, 1999] and 
[Horrocks et al, 2000c]. In order to process OWL DL ontologies, reasoning 
systems typically applied some “work-around” with respect to nominals, 
e.g., by treating them as primitive classes. This work-around is sound but 
incomplete for subsumption. I.e., given ontologies O and O' , where O' has 
been derived from O by replacing each occurrence of a nominal o with a 
primitive class C Q , a class C is subsumed by a class D with respect to O if C 
is subsumed by a D with respect to O', but if C is not subsumed by a D with 
respect to O', then we cannot be sure that C is not subsumed by a D with 
respect to O. Clearly, the work-around is unsound for satisfiability, i.e., there 
may be concepts that are satisfiable with respect to O', but unsatisfiable with 
respect to O. 

This situation was clearly very unsatisfactory given the motivation for 
OWL’s Description Logic-based design, and it was always anticipated that 
existing tableaux decision procedures for STilQ and STiOQ would soon be 
extended to STiOlQ. Although this took a little longer than anticipated, a 
tableaux decision procedure for STiOlQ was eventually developed [Horrocks 
and Sattler, 2005], and both the Pellet system and the FaCT+- 1- system 
[Tsarkov and Horrocks, 2005] (the successor of the FaCT system) now use 
this algorithm to fully support OWL DL. It is expected that other reasoners, 
including Racer, will soon follow suit. 
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It is interesting to note that OWL is now supported by commercial De- 
scription Logic systems. These include the Cerebra system from Cerebra 
Inc. (formerly Network Inference), and RacerPro, a commercial version of 
the Racer system. There are also several OWL reasoners that are not based 
on tableaux decision procedures. These include KAON2, a system that uses 
a reduction of S7ilQ( V) to disjunctive datalog [Hustadt et al, 2004], and 
Hoolet, a system that uses the Vampire First Order Theorem prover via a 
translation of STiOTM (D) into FOL [Tsarkov et al., 2004]. 

The growing importance of ontologies, and the emergence of the OWL 
standard, has also given impetus to the development of ontology engineering 
tools, including tools for editing, validating, visualizing, merging, and debug- 
ging OWL ontologies. Several Application Programming Interfaces (APIs) 
for OWL are also available. 

Of the available OWL Editing tools, probably the best-known and most 
widely used is Protege [Gennari et al., 2003]. Protege is a frame-based 
editor that supports OWL via an OWL Plugin. The Plugin uses a range of 
techniques (some of which were first developed in the OilEd editor [Bech- 
hofer et al., 2001b]) to extend the language that can be dealt with, e.g., 
by explicitly specifying quantification with slots and allowing for multiple 
necessary and sufficient conditions in class definitions. In addition to the 
frame editor, there are additional Protege plugins supporting, e.g., ontol- 
ogy visualization, ontology documentation, and “wizards” that can auto- 
mate some basic steps in the ontology development process. Protege can 
connect to any Description Logic reasoner with a DIG compliant interface, 
and uses the reasoner to check class consistency, to compute the class hier- 
archy, and to compute the most specific class(es) that each individual is an 
instance of. 

Several other OWL editing tools are also available. These include OilEd 
[Bechhofer et al., 2001b] (from Manchester University), SWOOP [Kalyan- 
pur et al., 2005a] (from the Pellet team) and Construct (from Cerebra). The 
design of OilEd is based on that of Protege, but it provides more complete 
support for OWL DL. Like Protege, OilEd can use any DIG compliant 
reasoner to reason over the ontology. SWOOP is browser-based, and is much 
more tightly linked to OWL’s syntactic structure; it provides both abstract 
syntax and RDF /XML syntax editing modes, and fully supports OWL DL. 
SWOOP uses the Pellet system for reasoning support, and also has an inte- 
grated debugger [Kalyanpur et al., 2005b]. Construct is a graphical tool that 
uses a UML-like notation; it uses the Cerebra reasoner to provide reasoning 
support. 
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OWL editing tools are also expanding into the software engineering realm, 
as tools such as Sandpiper Software’s Medius Visual Ontology Modeler sup- 
ports ontology development using UML modeling tools with output in OWL 
(see http://www.sandsoft.com/). This enables broader communities to 
model, edit, and integrate with OWL ontologies. 

Although OWL was initially developed as an ontology language for the 
Semantic Web, OWL has also been widely adopted as a de facto standard 
for ontology-based applications. These include “traditional” applications in 
e-Science and -Medicine, as well as applications in industry and government. 
The advantages of using OWL in such applications include the relative sta- 
bility and interoperability conferred by a W3C standard, and the availability 
of an expanding range of reasoners and tools. 

Examples of OWL ontology applications include: 

• Ontologies developed by members of the Open Biomedical Ontologies Consortium 
(sec http://obo.sourceforge.net/), which recommends OWL as the exchange 
language for all Life Science ontologies. These include the widely used Gene On- 
tology (GO) and Microarray Gene Expression Data (MGED) ontology. 

• The US National Cancer Institute (NCI) “thesaurus” , an ontology containing the 
working vocabulary used in NCI data system (see http://ncicb.nci.nih.gov/ 
NCICB/ core/EVS/). 

• United Nations Food and Agriculture Organization (FAO) is using OWL to de- 
velop a range of ontologies covering areas such as agriculture and fisheries (see 
http : / /www . f ao . org/ agris/ aos/ Applications/ intro . htm) . 

• The Semantic Web for Earth and Environmental Terminology (SWEET) on- 
tologies developed at the US National Aeronautics and Space Administra- 
tion (NASA) Jet Propulsion Laboratory (see http://sweet.jpl.nasa.gov/ 
ontology/). These include ontologies describing space, the biosphere, and the 
sun. SWEET is now being expanded by a number of earth and space science ef- 
forts, and has been augmented in the GEON project (see http : //www . geongrid . 
org/) to cover the solid earth, and by the Virtual Solar Terrestrial Observatory 
Project (see http : //vsto . hao . ucar . edu/) to include much more information on 
the atmosphere. 

• An ontology used at General Motors in a project to help quality improvement 
activities for assembly line processes in different production sites [Morgan et al., 
2005], 


14.4 Summary 

The Semantic Web is envisaged as an evolution of the existing Web where 
terms defined in ontologies will be used to give well defined and machine- 
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processable meaning to Web accessible information and services. OWL, a 
Description Logic-based ontology language, has been designed for this pur- 
pose. 

Because of the ambitious design goals for OWL, because of the multiple 
influences on OWL, and also because of the structural requirements con- 
straining OWL, the development of OWL has not been without problems. 
Through hard work and compromise, these problems have largely been over- 
come, resulting in an ontology language that is truly part of the Semantic 
Web. 

It was not possible to simultaneously satisfy all of the constraints on OWL, 
so two styles of using OWL have been developed, each appropriate under 
different circumstances. 

One style of OWL usage is motivated by the need to unambiguously rep- 
resent information in an expressive language, but one that can still be rea- 
soned with predictably. When this is a primary goal, OWL DL will be the 
target language, and it has been used in a number of existing applications. 
When using OWL DL, some compatibility with RDF is lost, mostly having 
to do with using classes and properties as individuals. On the other hand, 
users of OWL DL benefit from decidable inference, and the availability of 
an increasingly wide range of tools and infrastructure, including efficient 
reasoning systems and sophisticated ontology development environments. 
OWL DL also has a frame-like alternative syntax that can be used to make 
working with OWL easier. 

Even though OWL DL is, essentially, a Description Logic, it also in- 
cludes features that place it firmly in the Semantic Web. OWL DL uses the 
datatyping mechanisms from RDF and many of the built-in XML Schema 
datatypes. OWL DL uses RDF URI references as names, including the 
names from RDF, RDFS, and XML Schema datatypes that are relevant. 
Entailment in OWL DL is compatible with entailment in RDF and RDFS. 

For users who still need unambiguous representation and predictable rea- 
soning, but for whom simplicity is more important than expressive power, 
the OWL Lite subset of OWL DL may be a good choice. This sublanguage 
rules out some of the things that can be said in OWL DL, but still retains 
considerable expressive power. Moreover, OWL Lite is supported by a wider 
range of reasoning tools, and as key reasoning tasks are of lower worst-case 
complexity, these reasoners might be expected to be more efficient, in gen- 
eral, than OWL DL reasoners. 

The other style of OWL usage is one where compatibility with RDF is 
the overarching concern. In this case, OWL Full would be an appropriate 
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choice. OWL Full extends RDF and RDFS to a full ontology language, with 
a well-specified entailment relationship that extends entailment in RDF and 
RDFS, while avoiding any paradoxes that might arise. However, entailment 
in OWL Full is undecidable, which is a significant issue in most circum- 
stances, and no effective tools for reasoning are available for OWL Full, nor 
are they expected to appear. Also, the user-friendly alternative syntax is not 
adequate for OWL Full, so RDF /XML must be used. 

In practice, relatively few OWL Full applications have emerged to date, 
and where OWL Full ontologies are found, they often turn out to be outside 
the OWL DL subset only as the result of minor syntactic errors. Fragments 
of OWL are, however, sometimes used as ad hoc extensions to RDFS. A 
common example is the use of OWL functional properties, and explicit equiv- 
alences and (in) equalities, in what would otherwise be an RDFS ontology. 

There remain, of course, significant issues that are deliberately not han- 
dled by OWL, but which are definitely relevant to many Semantic Web use 
cases: 

• OWL avoids anything related to non-monotonicity (such as default reasoning and 
localized closed- world assumptions); 

• OWL’s limited expressiveness excludes operations such as property-chaining, or, 
more generally, axioms with variables, such as rules (although there are already 
proposals for extensions in this direction [Horrocks et al., 2005; Eiter et al, 2004; 
Motik et al., 2004; Rosati, 2005], some of which are based on earlier work on 
integrating Description Logics and logic programming rules as described in Chap- 
ter 2); 

• like the Description Logics on which it is based, OWL does not support N-ary 
relations (although Description Logics supporting N-ary relations are known [Cal- 
vanese et al, 1999c]); 

• OWL does not allow for the use of data values as database style keys [Lutz 
et al., 2004], or for functionality or path constraints [Calvanese et al, 1999c; 
Khizder et al., 2001]; 

• OWL’s import mechanism is limited, and does not support fine-grained operations 
(such as the importation of parts of ontologies); 

• OWL integrates datatypes in a very clean way, but there is no notion of operations 
on these datatypes (such as integer arithmetic or string operations). 

Extending the current Semantic Web with some or all of these features 
not only will require a standardization effort, but sets a significant research 
challenge to the community. 
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Natural Language Processing 
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Abstract 

In most natural language processing applications, Description Logics have 
been used to encode in a knowledge base some syntactic, semantic, and prag- 
matic elements needed to drive the semantic interpretation and the natural 
language generation processes. More recently, Description Logics have been 
used to fully characterize the semantic issues involved in the interpretation 
phase. In this chapter the various proposals that have appeared in the lit- 
erature about the use of Description Logics for natural language processing 
will be analyzed. 


15.1 Introduction 

Since the early days of the Kl-One system, one of the main applications of 
Description Logics has been for semantic interpretation in natural language 
processing [Brachnran et al. , 1979]. Semantic interpretation is the derivation 
process from the syntactic analysis of an utterance to its logical form - in- 
tended here as the representation of its literal deep and context-dependent 
meaning. Typically, Description Logics have been used to encode in a knowl- 
edge base both syntactic and semantic elements needed to drive the seman- 
tic interpretation process. One part of the knowledge base constitutes the 
lexical semantics knowledge, relating words and their syntactic properties 
to concept structures, while the other part describes the contextual and do- 
main knowledge, giving a deep meaning to concepts. By developing this idea 
further, a considerable part of the research effort has been devoted to the de- 
velopment of linguistically motivated ontologies, i.e., large knowledge bases 
where both concepts closely related to lexemes and domain concepts coexist. 
Logical forms and various kinds of internal semantics representations based 
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on Description Logics may also provide the basis for further computational 
processing such as representing common meanings in machine translation 
applications, generating coherent text starting from its semantic content, 
answering database queries, and dialog management. 

After a big success in the 1980s and the beginning of the 1990s (see, 
e.g., the collection of papers in [Sowa, 1991]), the interest of the applied 
computational linguistic community in Description Logics began to drop, 
as did its interest in well-founded theories on syntax or semantics. At the 
time of writing this chapter, there is no major applied project in natural 
language processing making use of Description Logics. This is due to the 
positive achievements in real applications of the systems based on shallow 
analysis and statistical approaches to semantics, initiated by the applications 
in the message understanding area. 

In this chapter the basic uses of Description Logics for natural language 
processing will be analyzed, together with a little bit of history, and the role 
of Description Logics in the current state of the art in computational lin- 
guistics will be pointed out. Obviously, space constraints will lead to several 
omissions and over-simplifications. 


15.2 Semantic interpretation 

In order to understand the role of Description Logics in semantic interpre- 
tation, let us first introduce a general setting for the process of deriving a 
logical form of an utterance. 

A basic property of a logical form as a semantic representation of a nat- 
ural language constituent - such as a noun phrase (NP) or a verb phrase 
(VP) - is compositionality , i.e. , the semantic representation of a constituent 
is a function of the semantic interpretation of its subconstituents. Thus, 
a close correspondence between syntactic structure and logical form is al- 
lowed. In this way, a parser working according to some grammar rules can 
incrementally build up the semantic interpretation of an utterance using 
the corresponding lexical semantic rules of logical composition - specifying 
how the logical terms associated to the subconstituents are to be combined 
in order to give the formula for the constituent. Thus, each lexeme has an 
associated (possibly complex) logical term, which forms its contribution to 
the meaning of the utterance it is part of. 

In the context of such a formalism, an effective semantic lexical discrim- 
ination process could be carried out during parsing, by cutting out the ex- 
ponential factor due to the explicit treatment all the possible derivations. 
Semantically implausible interpretations can be discarded, by checking - 
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whenever the parser tries to build a constituent - the inconsistency of the 
logical form conrpositionally obtained at that stage. This leaves out many 
syntactically plausible but semantically implausible interpretations. Such 
a discrimination step is highly effective in restricted domain applications, 
where knowledge of the world considerably reduces the number of possible 
models. Clearly, the more the contextual and domain knowledge is taken 
into consideration when evaluating a logical form, the more effective is the 
discrimination process. Thus, consistency checking of logical forms plays the 
role of a generalized selectional restrictions mechanism. 

But what is the relationship between a syntactic constituent and its range 
of possible lexical semantic contributions? The conceptual content of a lex- 
eme should convey both the lexical relations - such as, for example, syn- 
onymy, hyponymy, incompatibility - and the subcategorization information 
about the expected arguments (aka complements) of the lexical entry. For 
example the verb paint may be conceptualized as an event having an agent 
thematic role corresponding to the subject syntactic argument, with a spec- 
ified selectional restriction being the concept animate. It is important to 
distinguish the syntactic information - such as the lexical relations and the 
subcategorization frame constraining the complements to have specific syn- 
tactic structures - from the semantic information - such as the thematic 
roles and their selectional restrictions. A semantic lexical entry will specify 
the appropriate mappings between the syntactic structure of the lexeme and 
the conceptual information. 

The situation is, of course, a bit more complex, since, for example, there 
is no direct obvious conceptual content to lexemes belonging to partic- 
ular syntactic categories like adjectives or adverbs. Moreover there is a 
distinction between complements (which are considered as internal argu- 
ments) and adjuncts (which are considered as modifiers). It is outside 
the scope of this chapter to analyze the correspondence between syntax 
and semantics and its compositional nature (see, e.g., [Jackendoff, 1990; 
Pustejovsky, 1988]). 

For example, the sentence “A painter paints a fresco” involves the con- 
cepts Painter, Fresco, and Paint, where the concept Paint has two thematic 
roles associated to it, an agent and a goal, with the concepts Animate and 
Inanimate as respective selectional restrictions. Moreover, the conceptual- 
izations should include the facts that a Painter is a subconcept of Ani- 
mate, a Fresco is a subconcept of Inanimate, and the concepts Animate and 
Inanimate are disjoint. This information is enough, for example, to vali- 
date the above sentence, while it would discard as semantically implausi- 
ble the sentence “A fresco paints a painter” . This conceptualization and its 



490 


E. Franconi 


relationship with the lexical knowledge can be encoded in a DL knowledge 
base. 

Many studies have been done about building a good DL knowledge 
base for natural language processing (also called ontology) [Bateman, 1990; 
Hovy and Knight, 1993; Knight and Luk, 1994; Bateman et al., 1995]- see 
also Chapter 14. A good linguistically motivated ontology ought to be par- 
titioned into a language- dependent but domain-independent part (the upper 
model) and a language-independent but domain-dependent part (the domain 
model) - but this result is theoretically very hard to achieve [Bateman, 1990; 
Lang, 1991]. A good linguistically motivated ontology should be used both 
for semantic interpretation and for natural language generation (see Sec- 
tion 15.4). The conceptualization in the ontology should be at a level of 
granularity which may depend on the application: if selectional restrictions 
are too specific, disambiguation is achieved, but probably many correct sen- 
tences will be rejected (e.g., the sentences involving some form of metaphor, 
type shifting, or metonymy); if selectional restrictions are too general, the 
opposite problem may appear. In principle, a good linguistically motivated 
ontology should be abstract, large-scale, re-usable. However, these goals are 
very hard to achieve since they conflict with the practical need to implement 
effective and discriminating ontologies in specialized domains. 

The ideas just sketched form the theoretical background of any application 
of Description Logics for semantic interpretation, since the early works where 
Kl-One was involved [Bobrow and Webber, 1980; Sondheimer et al., 1984; 
Brachman and Schmolze, 1985; Jacobs, 1991]. Every realized system relies 
on the so-called multilevel semantics architecture [Lavelli et al., 1992], where 
a sequence of processing phases is distinguished: 

• Lexical discrimination: whenever the parser tries to build a constituent, the con- 
sistency of the semantic part of such a constituent is checked. In parallel, a first 
logical form is built up - where references and quantifier scoping are still am- 
biguous - expressing the meaning of the sentence in the most specialized way 
with respect to the semantic lexicon and the background knowledge. Heuristics 
are applied to the minimal form in order to obtain a preferential ordering of the 
semantically consistent but still lexically ambiguous interpretations. 

• Anaphora and quantifier scoping resolution: the semantically plausible referents 
for linguistic expressions such as definite NPs, pronouns and deictic references are 
identified, and the scope of quantifiers is resolved by making explicit the different 
unambiguous interpretations. Syntax-based heuristics are used to cut down the 
various derivations to a unique unambiguous one. 

• Contextual interpretation: decides how to react in a given dialogic situation, con- 
sidering the type of request, the context, and the model of the user interest. 
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It makes use of knowledge about the speech acts, the dialog and the user 
model. 

It has to be emphasized that all the approaches aim at deriving a unique 
unambiguous logical form. For this purpose, the logical form is treated as 
a mere compositionally obtained data structure on which to operate ad 
hoc algorithms for solving ambiguities, with the support of the information 
represented in the knowledge base. There is no attempt to give a logic-based 
semantics to the “logical form” during the disambiguation phases. The role of 
Description Logics is thus limited to serving a lexically motivated knowledge 
base, which is used for building the logical form. Some approaches purport 
to represent the logical form itself as DL assertions, but in fact they use it 
just as a support for somehow computing the real logical form. Section 15.3 
will discuss the few DL-based well-founded approaches, where the whole 
semantic interpretation process has been given a logical foundation. 

A number of recent important projects involving Description Logics for 
semantic interpretation are listed below. 

• The JANUS system [Weischedel, 1989], where the consistency check of the selec- 
tional restrictions was implemented as a double up-and-down subsumption check. 

• The Xtra system [Allgayer et al., 1989], proposing a clear distinction between the 
domain independent linguistically motivated part of the knowledge base (called 
Functional-Semantic Structure, FSS), and the domain-dependent part (called 
Conceptual Knowledge Base, CKB) modeling the knowledge of an underlying 
expert system. 

• The Pracma project [Fehrer et al., 1994], in which an expressive Description 
Logic has been studied to support special inferences such as probabilistic reason- 
ing, non-monotonic reasoning, and abductive reasoning. 

• The Lilog project [Herzog and Rollinger, 1991], funded by IBM, a very ambitious 
research project for studying the logical foundations of the semantics of natural 
language, with an emphasis on computational aspects. The project belongs to the 
category of projects where the whole semantic interpretation process has been 
given a logical foundation - by means of a sorted first-order logic. However, the 
role of Description Logics is again just as a knowledge server during the various 
interpretation and disambiguation phases. 

• The AlFresco system, a multi-modal dialog prototype for the exploration of 
fourteenth century Italian painters and frescoes [Stock et al, 1991; 1993], and the 
natural language interface for the concierge of the system Maia, a mobile robot 
with intelligent capabilities in the domain of office activities [Samek-Lodovici 
and Strapparava, 1990; Lavelli et al., 1992; Franconi, 1994]. These systems are 
characterized by the presence of natural language dialogs, so that logical form 
becomes central to conveying the meaning for the evolving behavior of the system. 
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• The Verbmobil project [Wahlster, 2000], a large speech-to-speech translation 
project, with translations into German, English, and Japanese. In Verbmobil, 
the role of Description Logics is limited to the off-line pre-computation of a 
taxonomy of concepts with thematic roles and selectional restrictions, which are 
then used by ad hoc rules during the runtime disambiguation phase. 

• The Ford Direct Labor Management System (DLMS) [Rychtyckyj, 1996; 
1999] is one of the few industrial level examples of a DL-based application 
involving natural language. DLMS utilizes a DL knowledge base in a fairly 
standard way to build the semantic interpretation of process sheets - natural 
language documents containing specific information about work instructions - 
and to generate from them structured descriptions of the parts and the tools 
required for allocating labor at the car plant floor. 


15.3 Reasoning with the logical form 

Traditionally, the logical form has been considered in computational linguis- 
tics as only representing the literal — i.e., context- independent - meaning of 
an utterance, as clearly distinguished from the representation of the surface 
syntactic constituent structure, and from a deeper semantic representation, 
which is a function of discourse context and world knowledge. Thus, the log- 
ical form plays in these cases an intermediate role between syntax and the 
deep semantics, and it is therefore not intended to fully contain the mean- 
ing in context of the utterance. Moreover, quite often a further distinction 
is introduced between quasi logical forms - i.e., literal under-specified se- 
mantic representations - and proper logical forms - i.e., literal unambiguous 
derivations. 

The reasons for separating the literal under-specified, the literal unam- 
biguous, and the deep meaning representations are mainly pragmatic rather 
than theoretical. Pure linguists would say that any sentence has just one 
unambiguous meaning, and that any ambiguity is introduced by under- 
constraining the interpretation process - e.g., by not adequately considering 
the context knowledge. In such a case, they would speak of different possi- 
ble ending paths in the derivation (i.e., interpretation) process, each one of 
them being again unambiguous. Clearly, this approach is infeasible from a 
computational point of view: first, because the number of derivations might 
combinatorially increase; and secondly because the interdependencies among 
the derivations are lost. 

On the other hand, computational linguists consider ambiguities as part 
of the meaning of utterances, with the ultimate goal of being able to reason 
with such under-specified expressions, in order to increase compactness in 
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the representation and efficiency in the processing. Allen [1993] argues that 
one of the crucial issues facing future natural language systems is the devel- 
opment of knowledge representation formalisms that can effectively handle 
ambiguity. 

We can identify two main approaches. The classical computational 
approaches - like the ones described above - rely on the modularity of the 
semantic analysis process - the multilevel semantics architecture - starting 
from the under-specified representation and ending up with an unambiguous 
and context-dependent representation. The semantics- oriented approaches 
usually propose a very expressive logical language - possibly with an expres- 
sivity greater than first-order logic - with the goal of giving a clear semantics 
to many natural language phenomena, and in particular to ambiguities and 
under-specification. Ambiguities can be roughly classified as follows: lexical 
ambiguities introduced by, e.g., prepositions, nouns, and verbs; structural 
ambiguities such as PP-attachment ambiguities - involving prepositional 
phrases; referential ambiguities such as quantification scoping and anaphora. 

A disadvantage of the first approach is that there are no solid formal 
grounds for the proper use of the logical form, and in particular for the 
treatment of ambiguity, so that operations on the logical form are often 
based on heuristics and ad hoc procedures. This can be justified by the 
fact that reasoning on logical forms including - among other things - do- 
main knowledge, incomplete and ambiguous terms, unsolved references, and 
under-specified quantifications, is considered a hard computational task. 
Computational linguists have devised structural processing techniques based 
on syntax, selectional restrictions, case grammars, and structured informa- 
tion such as frames and type hierarchies - carefully trying to avoid or to 
drastically reduce the inclusion in the computational machinery of logical 
inference mechanisms for treating ambiguities. Of course, these techniques 
often need ad hoc mechanisms when such ambiguities come into play. The 
computational approach is an example of “ knowledge representation as en- 
gineering ” . 

On the other hand, a number of recent works in applying Descrip- 
tion Logics to natural language processing ([Quantz, 1995; Franconi, 1996; 
Ludwig et ai, 2000]) are getting closer to a semantics-oriented approach, but 
they follow a minimalist conceptualization, and they emphasize the compu- 
tational aspects. Instead of trying to solve sophisticated semantic problems 
of natural language, they try to logically reconstruct some basic issues in a 
general way, which is compositional, homogeneous, principled, and interest- 
ing from an applicative point of view. The main idea of these approaches is 
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to take logical forms seriously: they represent not only the literal meaning of 
the fragment but also lexical ambiguities, represent unresolved referents via 
variables and equality, interpret plural entities and (generalized) quantifiers, 
and are linked to a rich theory of the domain. To that end, an expressive 
logical language should have a proper reasoning mechanism, and nonetheless 
be compositional. 

In this section an abstract overview will be given by means of examples, 
in such a way that, we believe, common ideas will be captured. 

Let us first try to understand how a logical form can be characterized in 
terms of proper logical constructs. It is observed that, assuming the widely 
accepted Davidsonian view on eventualities, natural language phrases - such 
as an NP or a VP - explicitly introduce discourse referents stating the exis- 
tence of individuals or events of the domain model. Introduced referents are 
represented as existentially quantified variables. The possibility of having 
variables and constants allows for the representation of referential ambigui- 
ties. This is the basis of most works on logical formalizations of the logical 
form. 

For example, the NP A fresco of Giotto might be given the logical form 
3b. Fresco(6) A of(6, GIOTTO) 

while the NP A fresco painted by Giotto might be given the logical form 
3b, e. Fresco(6) A Paint(e) A agent(e, GIOTTO) A goal(e, b ). (15.1) 

As we have pointed out above, consistency checking of a (partial) logical 
form corresponding to a constituent may help in the semantic discrimination 
process. Thus, in a restricted application domain, we would like to discard 
a sentence like A fresco paints Giotto, since its logical form 

3b, e. Fresco(6) A Paint(e) A agent(e, b) A goal(e, GIOTTO) 

would be inconsistent with respect to a general domain theory of frescoes 
and animate things that we could attach to the lexicon: 

Vx,y. Paint(x) — > (agent(x, y) — > Animate(y)) 

\/x. Animate(x) — > ->lnanimate(x) 

Mx. Fresco(x) — > Inanimate(x). 

Such an axiomatic theory plays the role of meaning postulates for the pred- 
icates appearing in the logical form; they can be also considered as a set of 
predicate level axioms. Using a DL-based formalism, this will be written as 
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the following theory: 

Paint C Vagent. Animate 
Animate C -ilnanimate 
Fresco C Inanimate. 

This is the place where Description Logics play a formal role as general 
domain theories representing the basic ontological properties of common- 
sense domain knowledge. 

Let us consider the deep meaning of A fresco of Giotto. The NP is ambigu- 
ous (at least) with respect to the two readings A fresco painted by Giotto 
and A fresco owned by Giotto. We could reformulate the ambiguous logi- 
cal form, by enumerating the unambiguous derivations, i.e. , by disjoining 
the logical forms of the two readings. However, it is infeasible to explic- 
itly enumerate all the (exponentially large) number of readings; moreover, 
this would not add any information to the logical form. Note however that 
traditional computational approaches purport to always find a unique un- 
ambiguous representation for the final logical form, based on syntactically 
and contextually motivated heuristics; in this case, the enumeration will be 
the basis for an ad hoc preferential ordering. If the logical form is written 
instead as 


3b. Fresco(fr) A (paintedBy LI ownedBy)(6, GIOTTO) (15-2) 

then each of the two readings clearly entails this ambiguous (or, better, 
under-specified) representation. Of course, the use of an explicit disjunction 
to encode the ambiguity requires a particular treatment of the natural lan- 
guage negation, which cannot be represented as a classical negation in the 
logical form. In fact, derivations from the ambiguous content are indepen- 
dent traces and, for example, De Morgan’s law would not hold anymore. 
The treatment of natural language negation has never been considered in 
DL-based approaches. So, we assume the logical form to be always positive; 
of course, this is not necessary for the DL-based domain theory. 

In this way, the lexicon - which can be considered as an associated theory - 
may contain a meaning postulate for the relation of: 

Vx, y. of(x, y ) paintedBy(x, y) V ownedBy(x, y) 

which can be rewritten using Description Logics as 

of = paintedBy U owned By. 

Moreover, if one writes reification axioms (see [Franconi and Rabito, 1994]) 
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of the kind 

Vx,y. paintedBy(y, x) <-> 3z. Paint(z) A agent(z, x) A goal(z, y) 

then the logical form (15.1) with the explicit event also entails the ambiguous 
representation (15.2). In Description Logics, this would be written as 

painted By = goal - |p a int 0 agent. 

The ambiguity of A fresco of Giotto can be monotonically refined later on in 
the dialog by uttering, e.g., either Giotto painted the fresco in Siena or Giotto 
sold his fresco. The refinement process is monotonic, since it is not necessary 
to revise the knowledge asserted by means of the logical form (15.2). 

Lexical ambiguities of nouns can also be represented, as in the example 
The pilot was out - where pilot can be a small flame used to start a furnace, 
or a person who flies airplanes. The sentence He was on the toilet mono- 
tonically refines the previous one, because the pronoun he may refer only 
to a person, thus excluding the reading with flame. Of course, in order to 
make possible such a reasoning by cases, axioms at the predicate level hav- 
ing negation and, more generally, partitioning capabilities have to be added 
to the theory - specifying and reducing the possible models: 

Pilot = Flame U Aviator 
Flame C Process 
Aviator C Human 
Human C Animate 
Animate n Process C _L. 

Verb ambiguity is also captured in the same manner. For example, it is 
possible to rule out the sentence The door opens the door , given the two 
senses of open as “cause to open” - transitive, with an animate agent - 
and “become open” - intransitive. According to these two senses, both the 
constituents “ The door opens ” and “ opens the door ” are consistent, but the 
whole sentence is inconsistent. 

Talking briefly about structural ambiguities, a general theory of common- 
sense knowledge will allow only for one interpretation of Giotto paints the 
fresco with a brush where the PP attaches to the painting event - “ paints 
with a brush ” - ruling out the interpretation u the fresco representing a 
brush”. An early detection of the semantic inconsistency solving the PP- 
attachment problem is very important in practical applications, since the 
non-deterministic choice among the different interpretations is usually left 
to the parser. Thus, the parser does not need to compute a combinato- 
rial number of derivations. Clearly, any metaphoric aspect of language is 
excluded in these approaches. 
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Following a semantics-oriented approach as sketched in this section, 
Quantz [1993; 1995] proposes a preferential DL-based approach to disam- 
biguation in natural language processing. He gives a particular emphasis to 
the problem of anaphora resolution, showing that an adequate disambigua- 
tion strategy has to be based on factors which take globally into account 
heterogeneous information (e.g., from syntax, semantics, domain knowledge) 
and yield preferences with varying degrees of relevance. For this purpose, 
Quantz introduced and developed a sound and complete proof theory for 
a preferential Description Logic , including a non-monotonic extension with 
weighted defaults. In his approach, a DL theory comprises syntactic, seman- 
tic, domain, and pragmatic knowledge, which globally contributes to the 
preferential disambiguation process, following the proposal by [Hobbs et al, 

1993] . 

Franconi [1996] proposes a formalism based on expressive Description 
Logics complemented with the ability to express logical forms as conjunc- 
tive queries [Calvanese et al., 1998a], i.e. , formulae in the conjunctive exis- 
tential fragment of first-order logic. The formalism allows for both under- 
specified semantic representations and encapsulation of contextual and do- 
main knowledge in the form of meaning postulates. In particular, lexical 
ambiguities, structural ambiguities, and quantification scoping ambiguities 
[Franconi, 1993] are considered, and an account of the structure of events 
and processes in terms of tense and aspect is given [Franconi et al., 1993; 

1994] , It is shown how to apply this logic for lexical discrimination based on 
semantic knowledge. 

Ludwig et al. [2000] present a modified version of Discourse Represen- 
tation Theory (DRT) and show that its Discourse Representation Struc- 
tures (DRSs) may be expressed as assertional statements in a Description 
Logic. This allows lexical discrimination during the parsing process based 
on the domain model. In order to capture situations where the available 
information is insufficient to characterize the meaning of an utterance, a 
partial logic (called first-order ionic logic ) is introduced to represent and 
reason with the logical form. The approach combines in an elegant way 
linguistic and contextual semantics - both represented in the DL domain 
model. 


15.4 Knowledge-based natural language generation 

In the previous sections an architecture for semantic interpretation was in- 
troduced, where Description Logics were used to build a knowledge base 
with lexical and conceptual information. The knowledge base encodes the 
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necessary data for building the logical form from the analysis of some nat- 
ural language text. In this section we mention another task which makes 
use of the same body of knowledge expressed in a DL-based ontology, but 
with the dual goal of generating a coherent (multi-sentential) natural lan- 
guage text, starting from an abstract non- linguistic specification of its mean- 
ing. Examples are in the context of dialogs (see, e.g., [Stock et al, 1991; 
1993]), of natural language instructions (see, e.g., [Moore and Paris, 1993; 
Di Eugenio, 1994; 1998; Paris and Vander Linden, 1996a; 1996b]), of 
language translation (see, e.g., [Dorr, 1992; Dorr and Voss, 1993; 1995; 
Knight et al, 1995; Quantz and Schmitz, 1994; Wahlster, 2000]), or of multi- 
media presentations (see, e.g., [Wahlster et al, 1993; Andre and Rist, 1995; 
Andre et al, 1996]). 

The lexical and conceptual knowledge base classifier is the main driv- 
ing component for the algorithms used to solve the problem of lexical 
choice, i.e., the task of choosing an appropriate target language term 
in generating text from an underlying logical form [Dorr et al, 1994; 
Stede, 1999]. The lexicalization problem is a non-trivial one, since it is pos- 
sible to have alternative lexical choices covering various (overlapping) parts 
of the content representation - a translation divergence - or it may be nec- 
essary to change the conveyed information content in order to find a viable 
lexical choice - a translation mismatch. The problem is usually solved by 
using ad hoc algorithms which make use of the classifier for determining 
which lexical units can potentially be used to express parts of the logical 
form representing the content. 

The choice and the realization of the most appropriate verbalization 
should be made in the context of the previous utterances (in the case of a 
dialog), of the surrounding environment (in the case of multimedia presen- 
tation), and of the overall goal of the ongoing communicative act. For these 
tasks, it is not enough to have an underlying representation of the content 
of the text to be generated, but a pragmatic aspect has to be considered as 
well. The pragmatic knowledge about the rhetorical interrelationships which 
occur among the various parts of the broader linguistic and extra-linguistic 
context of the communication is needed to generate a coherent presentation 
in agreement with its communicative goals. In other words, on the one hand 
there is the content to be presented, on the other hand there is the style of 
its presentation which should use the most appropriate linguistic expressions 
to convey the message. 

In order to generate a text satisfying the communicative goals and the 
coherence requirements, a planning algorithm is used to generate an overall 
structured text (or discourse) strategy, giving the general shape of the text. 
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Using the lexical and conceptual information in the knowledge base, the 
planner - by taking into account the grammar of the target language - 
converts the text plans into a specialized unambiguous representation of 
the semantic and syntactic information necessary to select the appropriate 
target language terms [Moore and Paris, 1993; Paris and Vander Linden, 
1996b]. 
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Abstract 

In contrast to the relatively complex information that can be expressed in 
DL ABoxes (which we might call knowledge or information), databases and 
other sources such as files, semistructured data, and the World Wide Web 
provide rather simpler data, which must however be managed effectively. 
This chapter surveys the major classes of application of Description Logics 
and their reasoning facilities to the issues of data management, including: 

(i) expressing the conceptual domain model/ontology of the data source, 

(ii) integrating multiple data sources, and (iii) expressing and evaluating 
queries. In each case we utilize the standard properties of Description Log- 
ics, such as the ability to express ontologies at a level closer to that of human 
conceptualization (e.g., representing conceptual schemas), determining con- 
sistency of descriptions (e.g., determining if a query or the integration of 
some schemas is consistent), and automatically classifying descriptions that 
are definitions (e.g., queries are really definitions, so we can classify them 
and determine subsumption between them). 


16.1 Introduction 

According to [ElMasri and Navathe, 1994], a database is a coherent collection 
of related data, which have some “inherent meaning” . Databases are similar 
to knowledge bases because they are usually used to maintain models of 
some universe of discourse (UofD). Of course, the purpose of such computer 
models is to support end-users in finding out things about the world, and 
therefore it is important to maintain an up-to-date and error-free model. 
The main difference between databases and knowledge bases is that while the 
former concentrate on manipulating large and persistent models of relatively 
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simple data, the latter provide more support for inference - finding answers 
about the model which had not been explicitly told to it - and involve fewer 
but more complex data. 

Following the functional view of knowledge bases advocated by Levesque, 
we expect a number of operations that can be applied to the KB, such 
as define, tell, and ask. Each of these operations involves one or more 
languages, such as the schema/constraint language, the update language, 
the query language and the answer language. In an earlier paper surveying 
the application of Description Logics to data management [Borgida, 1995], 
it has been argued that Description Logics offer advantages for each of these 
languages, as well as the internal processing of queries. 

We begin by providing a review of the important notions involving 
databases, their development and use, in preparation for examining the ap- 
plication of Description Logics in these tasks. 

First, one needs to describe the UofD about which the database will be 
knowledgeable. This is a form of requirements specification, which is nor- 
mally undertaken using some high-level language, because the requirements 
will have to be understandable both to end-users and implementors, so they 
can agree on the goals. In databases, the best known such language is the 
Entity-Relationship (ER) data model, 1 but many other so-called semantic 
modeling languages have been proposed [Hull and King, 1987]. The ER data 
model will be described in considerable detail and precision in Section 16.2; 
for now, suffice it to say that it views the world as populated by entities, 
which are related to each other by n-ary relationships, and are described by 
attributes having atomic values. Note that a semantic model may be con- 
cerned with the UofD as well as the data to be stored in the computer, and 
consists of mostly time-invariant generic information (e.g., “every depart- 
ment has exactly one manager”) as opposed to specific facts (e.g., “Edna 
manages the shipping department”). The semantic model introduces the 
terms to be used in talking about the domain, and captures their meaning 
by their interrelationships and constraints on them. 

From this generic description of the UofD, the database designer devel- 
ops a logical schema, describing the structure of data stored in the database, 
including the data types, interconnections, and constraints that must hold. 
Different data models are used for this purpose, but the relational data 
model has become the logical model of choice. While in the semantic mod- 
eling phase the emphasis was on a natural and direct mapping to the UofD, 
in this case the driving force is the existence of large software systems called 


1 The term “data model” refers to a language or set of concepts for describing a class of databases. 
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Database Management Systems (DBMSs), which support the management 
of the data in the model. For example, the relational data model views data 
as being stored in the form of tables/relations, with rows/tuples contain- 
ing primitive data types (e.g., integers, strings). In this case, the schema 
contains, among other things, the name of each table, with its columns 
and their datatype. For example, table Supplies may have columns for the 
material, the supplier, the recipient, as well as the shipment date and the 
amount of material supplied. Relational DBMSs require that each table be 
given a subset of attributes (called a “key”) which uniquely identifies each 
tuple. DBMSs may offer additional ways to capture integrity constraints - 
assertions distinguishing valid from invalid states of the data. 

More recently, Object-Oriented DBMSs have been developed. These sup- 
port the management of persistent objects with intrinsic identity, which can 
be related to (collections of) other objects, not just atomic values. Such 
OO-DBMSs can be used, among other things, for providing persistence 
for object-oriented languages. Object-oriented languages and databases also 
support the notion of “method” or procedure attached to a class, as well as 
implementation encapsulation, but these aspects will not be considered in 
this chapter. 

The database is used of course to store facts about the (current) state of 
the world. Databases make the so-called “closed-world assumption”, which 
states that a fact is false unless it has been explicitly stated as true. This 
assumption works well, with the restriction that the database represents 
only a very limited form of partial information. In particular, databases do 
not allow the representation of disjunctive information, and support only 
a very limited form of existential quantification: if there is no information 
about an attribute, it is given the null value. 

In order to provide access to the data stored in databases, DBMSs support 
a variety of query languages - languages for specifying declaratively what 
data is to be retrieved. For relational databases, SQL is the practical query 
language of choice. However, from the theoretical point of view, first-order 
logic formulae with free variables are a much more elegant form, based on 
the observation that tables can be viewed as predicates. For example, 

3m, dl, d2. supplies^intel 7 , r, m, dl) A supplies^intel 7 , r, m, d2) A (d2 / dl) 

would be asking for recipients (values of the free variable r) who had received 
from 'intel 7 shipments of the same material (m) on different dates (dl, d2). 

Query languages of varying expressive power can be obtained by restrict- 
ing or extending the above “standard”. For example, the so-called “conjunc- 
tive” or “select-join-project” queries only allow formulae with existential 
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quantifiers and conjunction, while Datalog is a query language that permits 
the use of intermediate tables derived using Horn rules, and thereby sup- 
ports recursion [Ullman, 1988]. For example, if we want to describe when 
one company depends on another through a chain of suppliers, we could 
state the rules 2 

dependsOn(x, y) <— supplies(x, y, m, d). 

dependsOn(x, y) <— supplies(:r, z, m, d, a) A dependsOn(z, y, m2, d,2, 02). 

In many DBMSs, the result of a query is another structure of the kind 
found in the schema (e.g., relational queries return tables as answers). In 
some situations, either because a query is asked frequently or because we 
want to restrict the access of some users to a subset of the database, a query 
can be named, in which case it is called a view. If a view is materialized, 
then its value is stored rather than re-computed on demand, and it is kept 
correct after every update to the basic database. 

The DBMS performs a number of hidden functions, insulating users from 
the considerable details of the physical level. For example, the DBMS places 
the incoming data physically onto storage media, and provides data struc- 
tures and other information that permits efficient access to certain data at 
some later point of time. In particular, given a query, the DBMS attempts 
to optimize the time in which it is answered by looking at access structures 
available and statistical information, and using the ability to reformulate 
queries into other, equivalent ones. 

Over time, additional, more complex kinds of databases and DBMSs have 
appeared. For example, distributed databases keep information at a variety 
of sites connected by networks (e.g., so that data might be closer to where it 
is used most frequently). Note however that the user is unaware of this de- 
tail, and perceives a single database. Heterogeneous and federated databases 
are collections of independent databases which choose to share information 
but are maintained autonomously. Users may even be interested in obtaining 
information from all kinds of sources, including non-databases such as files, 
etc. In such situations, a significant problem is relating the logical schemas 
at the various sites in order to provide a schema that can be presented to 
the user. The rest of the chapter is devoted to showing a variety of roles 
that Description Logics (and reasoning with them) can play in database 
management. In particular, in Section 16.2 we take a detailed look at their 
use in semantic/conceptual modeling. We then examine the possible uses of 
Description Logics in querying and query processing in Section 16.3, while 

2 Variables appearing only on the right hand side of >” are assumed to be existentially quan- 

tified. 
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in Section 16.4 we will consider the utility of Description Logics in provid- 
ing integrated access to multiple information sources. We summarize the 
material in Section 16.5. 


16.2 Data models and Description Logics 

Recall that a “data model” is essentially a language or set of concepts for 
describing a class of certain kinds of databases. This section attempts to 
answer some questions about the relationship between data models and De- 
scription Logics: 

What are some examples of such relationships? First, we will con- 
sider in detail the translation of Entity-Relationship models into 
knowledge bases expressed in the VCR Description Logic. In Subsec- 
tion 16.2.5, we will consider more cursorily several other data models, 
such as OODB and semistructured data. 

How are relationships established? The answer is by (i) formalizing 
the data model (ER in this case), (ii) choosing an appropriate De- 
scription Logic ( VCR in this case), (iii) defining a translation function 
from the former to the latter, and (iv) proving that this translation 
is “information-preserving” (not done here, but detailed in [Calvanese 
et ai, 1999e]). 

What benefits can be derived from having established relation- 
ships? Most significant is the use of automated DL reasoning services 
to support the development and maintenance of correct models (Sub- 
section 16.2.4). In addition, since Description Logics are often more 
expressive, it is possible to suggest extensions to database data models 
that allow further information about the structure of the data to be 
captured (Subsection 16.2.3). 


16.2.1 The Entity-Relationship model 

In order to talk about the relationship between the Entity-Relationship 
(ER) model and Description Logics, it is necessary first to introduce the 
reader to the ER data model (see also Subsection 4.3.1 and Chapter 10). ER 
is the most widespread semantic data model, and it has become a standard, 
extensively used in the design phase of commercial applications. The ER 
model was introduced in [Chen, 1976], with minor variants and extensions 
proposed over the years (e.g., [Teorey, 1989; Batini et ai, 1992; Thalheim, 
1992; 1993]). 
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The basic elements of the ER model are entities, relationships, and at- 
tributes. An entity set (or simply entity) denotes a set of objects, called its 
instances, that have common properties. Elementary properties are mod- 
eled through attributes, whose values belong to one of several pre-defined 
domains, such as Integer, String, or Boolean. Properties that are due to rela- 
tions to other entities are modeled through the participation of the entity in 
relationships. A relationship set (or simply relation ) denotes a set of tuples 
(also called its instances), each of which represents an association among a 
different combination of instances of the entities that participate in the rela- 
tionship. Since each entity can participate in a relationship more than once 
(e.g., a company can be the recipient or sender in a “supply” relationship), 
the notion of ER-role is introduced to represent such a participation, and a 
distinguishing identifier within the relationship is assigned to it. The arity 
of a relationship is the number of its ER-roles. We assume that, for each re- 
lationship of arity n, the identifiers 1 , ... ,n are assigned to the roles of the 
relationship. 

An entity B is said to be a specialization or IS-A of another entity A, if 
all the instances of B are also instances of A. Relationships can be similarly 
related by IS-A. This induces an inheritance of the attributes of an entity 
to its subentities, and of the roles of a relationship to its subroles. The 
ER schema produced as a result of ER modeling is usually represented in 
a graphical notation, which is particularly useful for an easy visualization 
of the data dependencies. In the commonly accepted notation, entities are 
represented as boxes, whereas relationships are represented as diamonds. 
An attribute is shown as a circle attached to the entity for which it is 
defined. ER-roles are graphically depicted by connecting the relationship to 
the participating entities, and labeling the edges with the corresponding role 
identifier. An IS-A relation between two entities is denoted by an arrow from 
the more specific to the more general entity (analogously for IS-A relations 
between two relationships). Cardinality constraints can be attached to an 
ER-role in order to restrict the number of times each instance of an entity 
is allowed to participate via that ER-role in instances of the relationship. 

Such constraints can be used to specify both existence dependencies and 
functionality of relations [Cosmadakis and Kanellakis, 1986]. They are often 
used only in a restricted form, where the minimum cardinality is either 0 or 
1 and the maximum cardinality is either 1 or oo. Cardinality constraints in 
the form considered here have already been introduced in [Abrial, 1974], and 
subsequently studied in [Grant and Minker, 1984; Lenzerini and Nobili, 1990; 
Ferg, 1991; Ye et al, 1994; Thalheim, 1992; Calvanese and Lenzerini, 1994b]. 
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Fig. 16.1. Example of an ER schema. 


An example of an ER schema is reported in Figure 16.1. Such a schema 
models information, handled by an enterprise, about contracts between cus- 
tomers and departments for services, and about registration of customers 
at departments. Some customers may be registered at “promotion depart- 
ments” . 

For the purpose of relating the ER model to Description Logics it is bet- 
ter to have a more formal description, which also abstracts out the most 
important common characteristics present in the different variants. 

An ER schema S is constructed starting from pairwise disjoint sets of 
entity symbols, relationship symbols, ER-role symbols, attribute symbols, 
and domain symbols. Each domain symbol D has an associated predefined 
basic domain D Bt> , and we assume the basic domains to be pairwise disjoint. 
For each entity symbol, a set of attribute symbols is defined, and to each 
such attribute a unique domain symbol is associated. A relationship symbol 
of arity n has n associated ER-role symbols, each with an associated entity 
symbol, and defines a relationship between these entities. We assume that 
each ER-role symbol belongs to a unique relationship, thus determining also 
a unique entity. The cardinality constraints are represented by two functions 
cmins, from ER-role symbols to nonnegative integers, and cmaxs, from ER- 
role symbols to positive integers union the special symbol oo. IS- A relations 
between entities and between relationships are modeled by means of a binary 
relation fis- We do not need to make any special assumption on the form 
of As, such as acyclicity or injectivity. 

The semantics of an ER schema can be given by specifying which database 
states are consistent with the information structure represented by the 
schema. Formally, a database state B corresponding to an ER schema S 
is constituted by a nonempty finite set A , assumed to be disjoint from all 
basic domains, and a function ® that maps 
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• every domain symbol D to the corresponding basic domain D Bt> , 

• every entity E to a subset E B of A B , 

• every attribute A to a set A B C A B x Uzxe'D.s D Bt> , and 

• every relationship R to a set R B of labeled tuples over 

A labeled tuple over a domain is a function from a set of ER-roles to 
A e . The labeled tuple T that maps ER-role Ui to Oj, for i £ {1, . . . , n}, is 
denoted {Up. oi, . . . , U n : o n ). We also write T[[/,] to denote Oj, and call it the 
U i-component of T . The elements of E°, A , and are called instances of 
E, A, and R respectively. 

A database state is considered acceptable if it satisfies all integrity con- 
straints that are part of the schema. This is captured by the notion of legal 
database state. A database state B is legal for an ER schema 5, if it satisfies 
the following conditions: 

• For each pair of entities E\,E 2 with E\ A 5 E 2 , we have E B C E B . 

• For each pair of relationships with R\ As i? 2 , we have R B C R B . 

• For each entity E, if E has an attribute A with domain D , then for each instance 
e £ E B there is exactly one element a £ A B with e as first component, and the 
second component of a is an element of D Bt> . 

• For each relationship R of arity n between entities E\, . . . , E n , to which R is 
connected by means of ER-roles Ui,...,U n respectively, all instances of R are of 
the form {Up. ei, . . . , U n : e n ), where e* £ E B , i £ {1, . . . , n}. 

• For each ER.-role U of relationship R associated with entity E, and for each 
instance e of E , we have 

cmins{U) < |{r € R B \ r[U ] = e}\ < cmaxs{U). 


16.2.2 Transforming Entity-Relationship schemas into VCR 

knowledge bases 

In order to represent ER schemas in terms of DL knowledge bases, we make 
use of the Description Logic VCR , which has been formally introduced in 
Chapter 5. We recall here the syntax of VCR , which is a natural generaliza- 
tion of Description Logics to n-ary relations: in particular, atomic relations, 
of given arity between 2 and n max , belong to the basic elements of VCR , and, 
besides concept expressions, arbitrary relation expressions can be formed, 
according to the following syntax: 

R, := T n | P | ( %i/n:C ) | — 'R, | Rj n R 2 

C := Ti | A | -iC | CinC 2 | 3[$z]R | stfc[$i]R 
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where P and R denote respectively atomic and arbitrary relations, i denotes 
components of relations, i.e., integers between 1 and n max , n denotes the 
arity of a relation, i.e., an integer between 2 and n max , and k denotes a 
nonnegative integer. In what follows, we abbreviate (%i/n: C ) by ($i: C ) when 
n is clear from the context. Moreover, we use the following abbreviations: 

V[$i]R for -i3[$i]-iR 

^(fc+l)[$i]R for -i(^fc[$i]R) 

= k [$i]R for (<(fc+l)[$i]R)n(>)fe[$i]R). 

In VCR , n-ary relations are interpreted as sets of tuples of arity n, and the 
VCR constructs generalize those of traditional Description Logics. In partic- 
ular, besides the Boolean constructs on concepts and relations, the construct 
($i/n: C ) denotes all tuples of arity n in which the zth component is an in- 
stance of concept C , and thus represents a unary selection. The construct 
3[$i]R denotes all objects that participate as the ith component in a tu- 
ple of relation R, and thus represents a unary projection. Finally ^ k [$i]R 
is a generalization of number restrictions to n-ary relations. We refer to 
Chapter 5, Section 5.7, for the formal semantics of the VCR constructs. 

We now show that the semantics of the ER model can be captured in VCR 
by defining a translation </> from ER schemas to VCR knowledge bases, and 
then establishing a correspondence between legal database states and models 
of the derived knowledge base. In the following, for each relationship R of 
arity n in S, we denote by /j,r a mapping from the set of ER-roles associated 
with R to the integers 1 , ,n. 

The knowledge base 4>(S) derived from an ER schema S is defined as 
follows: 

• The set of atomic concepts of consists of the set of entity and domain symbols 

in S: 3 

• The set of atomic relations of is obtained from the set of relationship and 
attribute symbols in S. More specifically: 

— each symbol R in S , denoting a relation of arity n, is mapped into a symbol 

in <f>(S), denoting a relation of arity n. 

— each attribute symbol A in S is mapped into a symbol P .4 in denoting 

a relation of arity 2. Thus, each instance of the relation P^ is a tuple such 
that its first component corresponds to an entity, while the second component 
denotes an element of the concept corresponding to the attribute domain. 

• The set of inclusion axioms of consists of the following elements: 


3 For the sake of simplicity, we model domains of ER schemas as concepts in T>C1Z. 
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— for each pair of entities E\, E 2 such that E i Es E 2 , the inclusion axiom 

E\ 13 E 2 

— for each pair of relationships R \ , R 2 such that R\ Es R 2 , the inclusion axiom 

Pfli E P r 2 

— for each attribute A with domain D of an entity E, the inclusion axiom 

E E (V[$l](P A n($2:£>)))n = l[$l]P A 

— for each relationship R of arity n with ER-roles Ui, ... ,U n in which each Ui is 
associated with the entity Ei, the inclusion axiom 

Pji E ($ti R (Ui):E 1 )n---n($n R (U n ):E n ) 

— for each ER-role U of relationship R associated with entity E, with cardinality 
constraints to = cmins(U) and n = cmaxs(U), 

* if to y^ 0, the inclusion axiom 

E C ^tm[$n R (U)]P R 

* if n y^ 00 , the inclusion axiom 

E E <n[$/ifl(J7)]P fl . 

Based on the results presented in [Calvanese et al, 1999e], the correctness 
of the translation presented above can be formally proved. More specifically; 
let S be an ER schema. Then, there is a one-to-one correspondence between 
legal database states of S and models of the VC1Z knowledge base cf{S ). 
For example, an entity E can be populated in a legal database state for S 
if and only if <f>(S) admits a model in which E has a nonempty extension. 
This allows us to exploit reasoning techniques developed for the logic VCTZ 
in order to reason on ER schemas. 

For example, by applying the translation presented above to the ER 
schema in Figure 16.1, presented earlier, we obtain the following VCfZ knowl- 
edge base: 

CONTRACT C ($1: Client) n ($2: Service) n ($3: Department) 

REG— AT C (Si: Client) n ($2: Department) 

PROMOTION E REG-AT n ($2: PrDept) 

Department E V[$l](LOCATION n ($2: String)) n = 1 [$l]LOCATION 
PrDept E Department. 
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16.2.3 Additions to the Entity Relationship model 

The ER model does not provide several features which would prove useful 
in order to represent complex dependencies between data. On the other 
hand, the richness of constructs that is typical of Description Logics, and 
the correspondence between the two formalisms established in the previous 
subsection, makes it possible to add such constructs to the basic model and 
take them fully into account when reasoning on a schema. We provide several 
examples of useful additions to the basic ER Model that arise as a natural 
consequence of the correspondence with the Description Logic VCR. We 
also consider a feature of the original ER Model that appears to force VCR 
itself to be extended. 

• Arbitrary Boolean constructs on entities. The only direct relationship between 
entities that can be expressed in the basic ER model is the IS-A relation. A 
common extension is by so-called generalization hierarchies (see e.g., [Batini et al., 
1992]), which allow one to express that the extension of an entity should be the 
disjoint union of the extensions of other entities. Such construct can easily be 
translated by making use of union and negation of VCR. 

• Refinement of properties along an IS-A hierarchy. Another important extension 
that should be considered is the ability to specify more complex forms of refine- 
ment of properties of entities along IS-A hierarchies, than the mere addition of 
attributes. This is already an essential feature of the more recent object-oriented 
models. In particular, cardinality constraints could be refined by restricting the 
range of values, and the participation in relationships can be restricted. One may 
require for specific instances of an entity that the objects they are related to via 
a certain relationship belong to a more specific entity than the one directly as- 
sociated to the ER.-role. Such forms of constraints can be naturally expressed in 
VCR by making use of universal quantification over relations. 

• Definitions of classes by means of complex properties. In the ER model (and more 
generally in semantic data models) one can specify only necessary conditions that 
the instances of entities (or more generally classes) must satisfy. This means that 
in a database that conforms to the schema one cannot deduce that a certain object 
is an instance of an entity unless this fact is explicitly stated. When modeling 
a complex domain, however, in order to capture more precisely the intended 
semantics, one would like to be able to define classes of objects through necessary 
and sufficient conditions, or even to state just sufficient conditions for an object 
to be an instance of a class. The former correspond in fact to views, which are 
important parts of database schemas. By using the different types of axioms of 
VCR , necessary and sufficient (and even just sufficient) conditions can be easily 
imposed and become part of the schema. 

• Key constraints. Because of their utility in physical database design, even the 
original ER model allowed the specification of key attributes/roles. Extending 
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Description Logics with key constraints (roles which uniquely identify objects) 
has been the subject of several investigations [Borgida and Weddell, 1997]. In 
particular, Calvanese et al. [2000b] have shown that reasoning about VCR aug- 
mented by key constraints can be performed without increasing the worst-case 
computational complexity. 

• Temporal constraints. Recent efforts in the conceptual modeling community have 
been devoted to properly capturing time- varying information, and several propos- 
als of temporally enhanced Entity-Relationship (ER) exist. [Artale and Franconi, 
1999; 2001; Artale et al., 2001] provide a DL-based logical formalization of the 
various properties that characterize and extend different temporal ER models 
which are found in literature. In particular, [Artale et al., 2001] define the De- 
scription Logic VCRuSi an extension of VCR with temporal constructs, and 
study decidability and complexity of reasoning in such a logic. 


16. 2. 4 Reasoning about Entity-Relationship schemas 

Providing a formalization of the ER schema in terms of the logic VCR 

allows several forms of reasoning on the ER schema to be supported. Typical 

reasoning tasks at the conceptual level supporting the designer of an ER 

schema S (see [Calvanese et al., 1998e]) include: 

• Entity satisfiability, i.e. , whether for every concept C, S admits a model in which 
it has a nonempty extension. If C must always have an empty extension then 
there is an inconsistency in its specification, or at the very least the concept is 
inappropriately named since it is a synonym for “EmptyEntity”. 

• Relation satisfiability, i.e., whether S admits a model in which a certain relation 
has a nonempty extension. (Similar to the above.) 

• Consistency of the ER schema, i.e., whether S admits a finite model. Without 
this, there is no database that satisfies the schema, which indicates that the to- 
tality of the definitions is inconsistent or requires an infinite model, which is 
a clear sign of incorrectness. Ideally, the reasoning system could provide expla- 
nations [McGuinness and Borgida, 1995; Borgida et al, 2000] for the source of 
inconsistencies, which could focus the search for modifications. 

• Redundancy of the ER schema. Various forms of redundancy in the ER schema 
can be detected: e.g., if A, B are entities and both AC B and B C A hold, we 
can conclude that one of the entities is redundant. 

• Stronger constraints on relationship roles. The concept and relationship specifica- 
tions may combine to yield stronger cardinality or domain constraints than those 
explicitly specified by the designer. (The simplest example is when we permit 
(multiple) inheritance.) 

• Entity subsumption, i.e., whether the extension of one concept B is a subset of 
the extension of another concept A in every model of S. This property suggests 
that the designer check for the possible omission of an explicit IS-A relationship 
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between B and A. Alternatively, if conceptually all B’s are not supposed to be 
A’s, then something is wrong in the rest of the schema, since it is forcing an 
undesired conclusion. 

• Relation subsumption, i.e., whether the extension of one relation is a subset of 
the extension of another relation in every model of S. (Similar to the above.) 

Ideas such as the ones above have been been pursued, for example, within 
the DWQ European Project [Bouzeghoub et al . , 1999], where the DL sys- 
tem FaCT [Horrocks, 1998b] has been successfully used as reasoning tool 
supporting the analysis and the integration of diverse database conceptual 
schemas [Franconi and Ng, 2000]. 


16.2.5 Description Logics and other data models 

Several other investigations have been carried out on the relationships be- 
tween Description Logics and database models: 

• [Bergamaschi and Nebel, 1994; Artale et al, 1996a; Calvanese et al . , 1999e] pro- 
vide formal models of object-oriented DBMSs using Description Logics. 

• [Borgida et al, 1989; Beck et al., 1989; Bergamaschi and Sartori, 1992] introduce 
semantic data models based directly on Description Logics, which are different 
from ER and previous database semantic data models. 

• More generally, class-based knowledge representation schemes, such as semantic 
networks, conceptual structures and frames [Lehmann, 1992; Sowa, 1991] have 
been considered as database models, or as ways to enrich the deductive capabilities 
of data models. These are related to Description Logics as suggested in Chapter 4. 

A recent important development in the field of data management has 
been the need to represent data whose structure is less rigid and strict than 
that held in conventional databases. Such semistructured data are impor- 
tant in many application areas, such as web information systems, biological 
databases, and digital libraries. Semistructured data is neither raw text, 
nor strictly typed as in conventional database systems [Abiteboul, 1997]. 
In many recent formalisms, semistructured data is modeled by graphs with 
labeled edges, where the label keeps information on both the values and the 
schema of the data. Many authors have noticed that this model coincides 
with the ontology of Description Logics, where roles correspond to edges. In 
[Calvanese et al., 1998c] it is shown that expressive Description Logics can 
not only capture semistructured data schemas, but can also add the ability 
to express several new kinds of constraints. The same kind of investigation 
has been carried out in [Calvanese et al., 1999d] for the case of the XML 
language, which is currently a very popular formalism for semistructured 
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data on the web (see Chapter 4, Subsection 4.3.3 for more details). 


16.3 Description Logics and database querying 

We have seen that descriptions can be used to present the schema of a 
database. For example, to emulate object-oriented databases, classes are 
equated with primitive concepts, while type restrictions on attributes are 
presented as necessary conditions that apply to these primitive classes in 
the form of role restrictions. In addition, certain integrity constraints can 
be expressed as rules of the form “if C then D”, or axioms CCD. On the 
other hand, since a concept description provides necessary and sufficient 
conditions for objects to satisfy it, it is natural to treat it as a query. So, 
in systems like Classic [Borgida et al., 1989] and Candide [Beck et al, 
1989], we have a unification of two traditionally distinct languages: the data 
definition and data manipulation languages. 


16.3.1 Description Logics as query languages 

Once the query is viewed as a concept description, we can perform the stan- 
dard operations on it. For example, the query description can be compared 
to the inconsistent description. If they are equivalent, this is almost surely 
a mistake on the part of the user - who would want to ask a query that 
never returns an object? The most likely reason for this is that the per- 
son asking the query is unfamiliar with the application domain. Since the 
query can be quite complex, and the schema quite large, a really helpful 
system would then assist the user in understanding the problem by isolat- 
ing the specific parts of the query and of the schema that are responsible 
for the contradiction. Such a tool can be built on top of explanation facili- 
ties available for certain Description Logics [McGuinness and Borgida, 1995; 
Borgida et al . , 2000]. 

More generally, in situations where the query returns no individuals in the 
current database, it has been argued that the query is “not interesting”, and 
should be generalized until a non-empty answer set is returned. As suggested 
by Anwar et al. [1992], this relaxation can be performed using the semilattice 
of descriptions provided by the subsumption relationship, which can guide 
the systematic weakening of terms in the query. 

The query can be classified with respect to the concepts in the schema. 
This can be used to help users pose queries in an unfamiliar domain, as 
follows: if the answer set contains unwanted values, the immediate subsumers 
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and subsumees of the query reveal other potentially relevant concepts (and, 
through subsumption assertions in the schema, roles as well) which the user 
may want to use for restricting the query. The result is a process of query 
specification by iterative refinement introduced by Tou et al. [1982] . 

Queries can also be classified with respect to each other into a sub- 
sumption hierarchy. In an environment where several people are asking ex- 
ploratory questions about the data over a long period of time (e.g., data 
mining by humans), it is very useful to have the questions organized so that 
the results of previous related queries can be reviewed [Brachman et al., 
1992], This prevents duplication of effort and, again, helps the user to pose 
queries that are more precise. 

Unfortunately, in exchange for a more expressive description of the 
schema, Description Logics pay the price of a weaker query language than 
usual: queries can only return subsets of existing objects, rather than creat- 
ing new objects (as in standard SQL databases); furthermore, the selection 
conditions are rather limited. In fact, it has been shown [Borgida, 1996] that 
even the most expressive Description Logics discussed in the literature until 
recently, could only express a variant of the “3-variable” subset of formu- 
lae of first-order logic - i.e., formulae that only use 3 variables, although 
allowing numeric quantifiers, like “exists at least n”. 

Given the expressive limitations of DL concepts alone as queries, it is 
reasonable to consider extending standard queries (in Datalog) with De- 
scription Logics. Two different approaches have been pursued: In one, in- 
spired by the work of Ai't-Kaci and Nasr [1986] on Login, and exem- 
plified by the AC- LOG language [Donini et al., 1998b], descriptions are 
used essentially as type constraints on variables appearing in Horn clauses. 
In this case, a crucial condition is that concept and role names form 
a disjoint set from the relations used in expressing rules. The second 
approach, exemplified by the Carin language [Levy and Rousset, 1996; 
1998], treats concepts and roles as ordinary unary and binary predicates 
that can also appear in query atoms. This is significant because it allows for 
the first time conjunctive queries to be expressed over DL databases/Aboxes. 

A second important distinction is between recursive and non-recursive 
Datalog queries. For the non-recursive case (which covers a large portion 
of practically useful queries), it seems possible to combine some expressive 
decidable Description Logics with Datalog, while keeping query answering 
and even reasoning on queries decidable (see Section 16.4). For the recursive 
case, undecidability arises sooner, but some studies have identified suitable 
restrictions on the DL language and/or on the form of Datalog rules, for 
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preserving decidability of query answering. 

Consider first .4/Z-log. In the rule 

happy(x) <— marriedTo(x, y) A employedBy(y, z) 

& Person(x) A Person(?/) A StartUp(2;) 

the tests after the ampersand & are for concept membership, while those 
before it are for n- ary relations, as in relational databases. The processing 
of such queries is complicated by the fact that the DL “type database” 
may contain disjunction or be otherwise incomplete. Instead of the standard 
answers, one gets a “conditional result”, with a side condition c describing 
necessary DL constraints on the variables in the query. For example, for the 
above query one might get as answer 

happy(ANNA) if Person(ANNA) 

in a database containing 

marriedTo(ANNA, JOE), employedBy(JOE, IBM), Person(JOE), StartUp(IBM). 

Donini et al. [1998b] establish that answering queries in recursive AC- 
LOG is decidable in the case when the Description Logic used is ACC. The 
framework of .4£-log is further extended in [Rosati, 1999] to the case of 
disjunctive Datalog, i.e., Datalog with negation as failure in rule bodies and 
disjunction in the head of rules. 

The Carin approach is more general, but this increase in expressive power 
comes at a price: for general Datalog rules, the query answering problem is 
now undecidable as soon as one allows MR.C or ^ nR as concept construc- 
tors. (These appear in most Description Logics.) However, if Datalog rules 
are restricted to avoid recursion, then query answering is decidable even 
for the ACCNlZ Description Logic. Numerous other results circumscribing 
the cases when query processing is decidable may be found in [Levy and 
Rousset, 1998]. 


16.3.2 Query optimization 

In the case when queries can be classified (as when they are descriptions or 
when the query implication problem is decidable), classification of queries 
has been proposed as a technique for query processing and optimization. In 
[Beck et al., 1989], queries are classified with respect to schema concepts; 
if the query concept Q is classified below concept C, then only instances 
of C need to be checked whether they satisfy the full query. Of course, in 
this classification process one uses the axioms describing the schema of the 
database. 
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If the answers to previous queries are cached, then the query concepts 
can be left in the classification hierarchy, together with the other concepts 
in the schema. The result is a simple form of the query optimization tech- 
nique known as “query answering using cached views” : find the most specific 
views V that subsume the query Q; check only the individual instances of 
V (which, recall, are locally available) to see if they satisfy the query. Po- 
tentially, this could provide considerable savings, especially when gathering 
information from multiple sites, for example. 

Buchheit et al. [1994b] elaborate on this by using a more powerful query 
language. In particular, in order to achieve the expressiveness of full first- 
order logic, expressing a query is viewed as a two-phase process: as much of 
the query as possible is written in the “query Description Logic” (yielding 
the so-called “structural part”), and the remainder of the query is written 
as a constraint in a first-order logic notation (yielding the so-called “dirty 
part”). For example, the following query asks for students whose advisor 
is the same as their committee chair and at least 5 years older than the 
student: 

QueryClass QueryStudent isa Student with 
derived 

11 : advisor: Prof 

12 : committee. (chair: Thing) 

where 11=12 constraint forall s/QueryStudent (s.age + 5 < s. advisor. age). 

In this case, assuming that cached views have only structural conditions, 
the query is classified using only its own structural conditions. Thereafter, 
only the instances of the view are tested using both the structural and dirty 
parts of the query. 

Finally, Bergamaschi et al. [1997] have investigated the use of Descrip- 
tion Logics in optimizing query evaluation in object-oriented DBMSs by 
eliminating redundant terms. This is accomplished by first expanding the 
query as much as possible using the information in the schema; for example, 
subsumption is used to test when the antecedent of a rule can be applied 
to the query (subsumes it) so that its consequent can be added to it. By 
repeatedly applying this process, an expanded query is obtained. Then, all 
the query subterms that subsume the rest of the query (and are therefore 
redundant) are eliminated one by one. The result is a semantically equiv- 
alent description/query which may be more concise than the original one; 
hence it may have fewer tests to evaluate. Furthermore, the new expanded 
query may be classified further down the pre-existing class/view hierarchy, 
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providing more efficient query evaluation, using the query classification 
technique described earlier. These are forms of so-called “semantic query 
optimization” . 

An issue related to efficient processing of large numbers of individuals 
is the situation where the user needs to query the conceptual model for 
DL instances, while the data are presented in a relational database, say. In 
other words, we need to obtain the proper ABox instances of the DL query 
(which involves concepts and roles) from the database. The main problem 
is that processing hundreds of thousands of individuals is not feasible with 
DL technology because in each case we try to perform complex inferences. 
However, most of the data in the database is very straightforward, and the 
corresponding individuals do not generate new inferences. The solution pro- 
posed in [Borgida and Brachman, 1993] is to associate with the primitive 
concepts (resp. roles) of the DL knowledge base unary (resp. binary) view 
tables defined over the DBMS. One can then automatically translate com- 
plex descriptions into complex SQL queries over these views. The important 
effect is that one gets the full benefit of DBMS optimization for the SQL 
query, and if only a few values satisfy the query, then only a few DL indi- 
viduals need to be created. For example, for a primitive DL class Student, 
we might take the values appearing in the enrol lee column of relational table 
Enrollment.R, and use this subset of the Person.R table to generate appro- 
priate individuals in a special view Student.R, which has only one column. 
(The generation of unique identifiers for these individuals is in itself a re- 
search issue.) Similarly, for example, one would generate a two-column view 
visitor_R corresponding to the role visitor. Complex descriptions over Student 
and visitor are then translated algorithmically into SQL queries over the cor- 
responding views. Additional optimizations turn out to be necessary to deal 
properly with multiple queries and functional roles [Borgida and Brachman, 
1993]. 


16.4 Data integration 

Integrating different data sources is one of the fundamental problems faced in 
the last decades by the database community [Batini et ai, 1986]. Generally 
speaking, the goal of a data integration system is to provide a uniform 
interface to various data sources [Levy, 2000], so as to enable users to focus 
on specifying what they want. As a result, the data integration system frees 
the users from tasks such as finding the relevant data sources, interacting 
with each source in isolation, and selecting, cleaning, and combining data 
from multiple sources. 
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The design of a data integration system is a very complex task, which 
comprises several different aspects. Our goal in this section is to discuss the 
use of Description Logics in two important aspects, namely: 

• The specification of the content of the various data sources. 

• The process of computing the answer to queries posed to the data integration 
system, based on the specification of the sources. 


16. 4-1 Specifying the content of data sources 

The typical architecture of a data integration system allows one to explicitly 
model data and information needs - i.e. , a specification of the data that the 
system provides to the user - at various levels: 

• The conceptual level contains a conceptual representation of the sources and of 
the reconciled integrated data, together with an explicit declarative account of 
the relationships among their components. 

• The logical level contains a representation of the sources in terms of a logical data 
model. 

The conceptual level As we have seen before, the conceptual level con- 
tains a formal description of the concepts, the relationships between con- 
cepts, and the information requirements that the integration application 
has to deal with. The key feature of this level is that such a description is 
independent of any system consideration, and is oriented towards the goal 
of expressing the semantics of the application. In particular, we distinguish 
among the following elements: 

• The Enterprise Conceptual Schema is a representation of the global concepts and 
relationships that are of interest to the application. It corresponds roughly to 
the notion of global conceptual schema in the traditional approaches to schema 
integration and to the notion of world view , as introduced in [Levy et al, 1995; 
Kirk et ah, 1995]. 

• For an information source S, the Source Conceptual Schema of S' is a conceptual 
representation of the data residing in S. 

• The term Domain Conceptual Schema is used to denote the union of both the 
Enterprise Conceptual Schema and the various Source Conceptual Schemas, plus 
possible inter-schema relationships [Catarci and Lenzerini, 1993]. 

We have seen in Section 16.2 that Description Logics are very well suited 
for data modeling at the conceptual level, so it comes as no surprise that De- 
scription Logics have also been used in data integration projects to represent 
Source and Enterprise Conceptual Schemas [Catarci and Lenzerini, 1993; 
Arens et al., 1993; 1996; Levy et al., 1995; Goasdoue et ah, 2000]. In this 
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section, following [Calvanese et al, 1998e], we will continue to use the VC1Z 
DL for specifying these conceptual schemas. 

As stated above, the Domain Conceptual Schema contains inter-schema 
relationships. In particular, since the sources are of interest in the system, 
integration does not simply mean producing the Enterprise Conceptual 
Schema, but rather being able to establish the correct interdependencies 
both between the Source Conceptual Schemas and the Enterprise Concep- 
tual Schema, and between the various Source Conceptual Schemas. 

To specify inter-schema relationships, we make use of the special kinds of 
assertions available in DL reasoning. In particular, following [Catarci and 
Lenzerini, 1993], one can use assertions of the following forms: 

L% E ex t Lj 
Lj C int Lj 

where Li and Lj are expressions of different schemas. In particular, Lj and 
Lj are either two relation expressions of the same arity, or two concept 
expressions. Intuitively, the first assertion states that Lj is extensionally 
included in Lj, which means that every object that satisfies the expression 
Lj in source i also satisfies the expression Lj in source j. For example, if 
the designer knows that the set of students stored in source 1 is a subset of 
those stored in source 2, then this knowledge is captured by the inter-schema 
assertion 


Studenti C ea , t Student2- 

The second assertion states that the concept denoted by the expression Lj 
in source i is a subconcept of the one denoted by the expression Lj in source 
j, which means that every object in source i satisfying Lj also satisfies Lj 
in source j, provided that it does appear in source j. For example, if the 
designer knows that the concept of student in source 1 is a subconcept of 
person in source 2, then s/he can use the inter-schema assertion 

Studenti L int Person2- 

It is worth noting that the possibility of reasoning about VCTZ schemas 
allows sophisticated forms of reasoning on inter-schema assertions, e.g., for 
inferring those extensional relationships between concepts that are implied 
by the knowledge on the intensional interdependencies. More details about 
these forms of reasoning can be found in [Catarci and Lenzerini, 1993; 
Calvanese et al., 1998e]. 



520 


A. Borgida, M. Lenzerini, and R. Rosati 


The logical level The logical level provides a description of the logical 
content of each source, called the Source Schema. Typically, a Source Schema 
is provided in terms of a set of relations using the relational logical model of 
data. So-called wrappers can be used to hide how the source actually stores 
its data, the data model it adopts, etc., and present the source as a set of 
relations. 

The link between the logical representation of a source and the Domain 
Conceptual Schema can be specified in two different ways. 

• According to the so-called global- as-view approach, a query over the source re- 
lations is associated to each concept in the Domain Conceptual Schema. Every 
such concept is thus seen as a view over the sources. 

• In the alternative local-as-view approach , one associates with each source relation 
a query that describes its content in terms of the Domain Conceptual Schema. 
In other words, the logical content of a source relation is described in terms of a 
view over the Domain Conceptual Schema. 

In [Levy, 2000], it is argued that the local-as-view approach has several 
advantages, and we will follow this approach in the rest of the chapter. 

To describe the content of the sources through views, one needs a notion of 
query such as the union of conjunctive queries over the Domain Conceptual 
Schema. Specifically, a source relation is described in terms of a query of the 
form 


<?(x) <- conj^-x, yi) V • • • V conj m (x , y m ) 


where: 

• The head g(x) defines the schema of the relation in terms of a name, and the 
number of columns. 

• The body describes the content of the relation in terms of the Domain Conceptual 
Schema. 

In [Calvanese et al, 2001c], conjjx, y,;) is a conjunction of atoms, and x, y \ 
are all the variables appearing in the conjunct (we use x to denote a tuple 
of variables x±, . . . ,x n , for some n). Each atom is of the form E(t), R( t), 
or A(t, t'), where t, t, and t! are variables in x, y t or constants, and E , R , 
and A are respectively entities, relationships, and attributes appearing in 
the Domain Conceptual Schema. 

The semantics of queries is as follows. Given a database that satisfies the 
Domain Conceptual Schema, a query q of arity n is interpreted as the set 
of ra-tuples (di, . . . ,d n ), with each d* an object of the database, such that, 
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when substituting each di for Xi, the formula 

E3y 1 .conj 1 (x, yi) V • • • V 3y m .conj m (x, y m ) 
evaluates to true. 

Analogously to the case of the conceptual level, it is interesting to perform 
several reasoning tasks on the DL representation of the sources, for example 
for inferring redundancies and/or inconsistencies among data stored in differ- 
ent sources. Since queries that include atoms from the Conceptual Schema 
are more expressive, new algorithms are required to answer the following 
problems: 

• Query containment. Given two queries q\ and q 2 (of the same arity n), check 
whether q\ is contained in ( 72 , i.e., check whether the set of tuples denoted by q± 
is contained in the set of tuples denoted by <72 in every database satisfying the 
Conceptual Schema. Papers that contain results relating to this question include 
[Levy and Rousset, 1998; Calvanese et al. , 1998a; Goasdoue and Rousset, 2000]. 

• Query consistency. Check whether a query q over the Conceptual Schema is 
consistent, i.e., check whether there exists a database satisfying the Conceptual 
Schema in which the set of tuples denoted by q is not empty. 

• Query disjointness. Check whether two queries q± and (72 (of the same arity) over 
the Conceptual Schema are disjoint, i.e., check whether the intersection of the 
set of tuples denoted by q\ and the set of tuples denoted by 52 is empty, in every 
database satisfying the Conceptual Schema. 


16.4-2 Query answering 

The ultimate goal of a data integration system is to allow the user to pose 
queries over the global view, and to answer the queries by accessing the 
sources in a transparent way. The mechanism for answering queries differs 
depending on the approach adopted for specifying the sources. The pos- 
sibility of reasoning about queries can provide useful support in both the 
global-as-view and the local-as-view approaches. As in the previous subsec- 
tion, here we focus on the local-as-view approach, that is the one in which 
query answering is most complex. 

In the local-as-view approach, relations at the sources are modeled as 
views over the virtual database represented by the Domain Conceptual 
Schema. Since the database is virtual, in order to answer a query Q for- 
mulated over the Domain Conceptual Schema, we can only use the source 
views. In other words, query processing cannot simply be done by looking at 
a set of relations, as in traditional databases, but requires reasoning on both 
the form of the query, and the content of the source views. This motivates 
the idea that query answering in data integration becomes the problem of 
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view-based query processing. There are two approaches to view-based query 
processing, called query rewriting and query answering , respectively. 

In the former approach, we are given a query Q and a set of view defini- 
tions, and the goal is to reformulate the query into an equivalent expression 
that refers only to the views available, and provides the answer to Q. 

In the latter approach, besides Q and the view definitions, we also take 
into account the extensions of the views, and the goal is to compute the set 
of tuples that are implied by these extensions, i.e. , the set of tuples t such 
that t satisfies Q in all the databases that are consistent with the views. 

Notice the difference between the two approaches. In query rewriting, 
query processing is divided in two steps, where the first re-expresses the 
query in terms of a given query language over the alphabet of the view 
names, and the second step evaluates the rewriting over the view extensions. 
In query answering, we do not pose any limit on query processing, and the 
only goal is to exploit all possible information, including view extensions, to 
compute the answer to the query. 

View-based query processing has been extensively investigated by the 
database community [Levy, 2000]. Only recently has the problem been stud- 
ied for the case where the Domain Conceptual Schema is expressed in De- 
scription Logics. For example, [Baader et al. , 2000] addresses the problem 
of rewriting queries that are concepts in terms of concepts in the con- 
ceptual schema. Query rewriting for more general queries (e.g., ones in- 
volving conjunctions of atoms) has been studied in [Beeri et al., 1997; 
Levy and Rousset, 1998; Goasdoue et al., 2000; Calvanese et al., 2001c], 
in some cases taking into consideration complex constraints expressed in 
Description Logic as part of the Conceptual Schema. One issue that must 
be addressed here is that the original query Q may not be rewritable as an 
expression over the views because of limitations of the language for combin- 
ing views. In this case, one must find heuristic best-effort approximations. 
Another issue is finding a minimum-cost rewriting (e.g., by eliminating un- 
necessary lookups in some of the views). 

Finally, we mention that Goasdoue et al. [2000] describe an implemented 
information integration system, which uses a combination of global-as-view 
and limited local-as-view approaches applied to the ALM DL and non- 
recursive Horn rules. 

Among the pioneering attempts at solving the query answering prob- 
lem is the Information Manifold system [Levy et al., 1996; 1995], which 
has detailed algorithms for query rewriting. In the context of heteroge- 
neous databases, Mena et al. [2000] propose that each source has its own 
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conceptual schema/ontology expressed in a Description Logic, and these 
are interrelated by adding “hyponym” (subsumption) relationships between 
concepts in each. (This is reminiscent of the approach in [Catarci and Lenz- 
erini, 1993].) One of the interesting features of this system is that it takes 
seriously the approximations resulting from the fact that some queries may 
not be expressible in terms of the combined ontologies. Among others, they 
study the notions of “precision” and “accuracy” of recall to quantify this 
approximation. A solution to the query answering approach is presented in 
[Calvanese et al, 2000a], which, among other things, illustrates the relation- 
ship between view-based query answering and ABox reasoning in Description 
Logics. 


16.5 Conclusions 

We have reviewed a number of ways in which Description Logics can be 
useful in the development and utilization of databases. 

Probably the most successful applications are in areas where the concep- 
tual model of the UofD is required. This includes the initial development 
stage, as well as access to heterogeneous data sources. 

Concerning the initial conceptual modeling: First, Description Logics are 
powerful enough to capture the domain semantics represented by various 
entity-relationship data models, as well as other data models introduced 
in the database literature. In fact, with most Description Logics, one can 
represent additional constraints. Second, because Description Logics have a 
clear semantics, the meaning of the DL model is unambiguous and precise. 
Third, not only can information be represented, but it can also be reasoned 
with: one can look for inconsistent class/entity definitions (ones that cannot 
have any individual instances) and more generally, one can check for the 
consistency of the entire model. Both of these are signs to the developer 
that there are modeling errors. Arguably, it is this third aspect, concerning 
reasoning with the model, that is the greatest advantage of DL models. 

DL descriptions can be viewed as necessary and sufficient conditions, and 
hence as queries (or views!) for a database. Description Logics are somewhat 
less successful in this regard (at least in their pure form), because they 
have limited expressive power compared to the standard calculi known from 
relational databases, and because they cannot generate new objects - only 
select subsets of existing objects. 

However, if one accepts a Description Logic as a data model, then DL 
queries can be classified with respect to schema concepts and previous 
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queries, supporting query by refinement and data exploration. The sub- 
sumption relationship can also be used for semantic query optimization. 

Combining Description Logics with Datalog rules, or at least supporting 
conjunctive queries from concepts, is a promising way to obtain a more ex- 
pressive query language. The evaluation of the resulting queries appears to 
be decidable with a wide range of Description Logics if the rules are not 
recursive. The addition of recursion appears to lead to undecidability rela- 
tively quickly. However, full recursion is not an necessity for practical appli- 
cations, such as information integration, so further research in the possible 
combinations of Description Logics and Datalog restrictions is warranted. 

The ability to represent the semantics of a UofD is also the reason why 
Description Logics are useful in situations where information is to be inte- 
grated from various sources, such as heterogeneous or federated databases. It 
is widely agreed that the integration needs to be achieved at the conceptual 
level. The Description Logic can be used to define the ontology of each site, 
and then these ontologies are inter-related; alternatively, a global ontology 
is specified, and then the sites are described as views over it. 



Appendix 

Description Logic Terminology 

Franz Baader 


Abstract 

The purpose of this appendix is to introduce (in a compact manner) the 
syntax and semantics of the most prominent DLs occurring in this handbook. 
More information and explanations as well as some less familiar Description 
Logics can be found in the respective chapters. For DL constructors whose 
semantics cannot be described in a compact manner, we will only introduce 
the syntax and refer the reader to the respective chapter for the semantics. 
Following Chapter 2 on basic Description Logics, we will first introduce 
the basic Description Logic AC, and then describe several of its extensions. 
Thereby, we will also fix the notation employed in this handbook. Finally, 
we will comment on the naming schemes for Description Logics that are 
employed in the literature and in this handbook. 


A.l Notational conventions 

Before starting with the definitions, let us introduce some notational conven- 
tions. The letters A, B will often be used for atomic concepts, and C, D for 
concept descriptions. For roles, we often use the letters R, S, and for func- 
tional roles (features, attributes) the letters f,g. Nonnegative integers (in 
number restrictions) are often denoted by n, m, and individuals by a, b. In all 
cases, we may also use subscripts. This convention is followed when defining 
syntax and semantics and in abstract examples. In concrete examples, the 
following conventions are used: concept names start with an uppercase letter 
followed by lowercase letters (e.g., Human, Male), role names (also functional 
ones) start with a lowercase letter (e.g., hasChild, marriedTo), and individual 
names are all uppercase (e.g., CHARLES, MARY). 
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A. 2 Syntax and semantics of common Description Logics 

In this section, we introduce the standard concept and role constructors as 
well as knowledge bases. For more information see Chapter 2. 


A. 2.1 Concept and role descriptions 

Elementary descriptions are atomic concepts and atomic roles (also called 
concept names and role names). Complex descriptions can be built from 
them inductively with concept constructors and role constructors. Concept 
descriptions in AC are formed according to the following syntax rule: 


A | 

(atomic concept) 

T | 

(universal concept, top concept) 

± 

(bottom concept) 

-iA | 

(atomic negation) 

CnD \ 

(intersection) 

VR.C | 

(value restriction) 

3R.T 

(limited existential quantification) 


Following our convention, A denotes an atomic concept and C, D denote 
concept descriptions. The role R is atomic since AC does not provide role 
constructors. 

An interpretation T consists of a non-empty set A x (the domain of the 
interpretation) and an interpretation function, which assigns to every atomic 
concept A a set A 1 C A x and to every atomic role R a binary relation R 1 C 
A 1 x A x . The interpretation function is extended to concept descriptions 
by the following inductive definitions: 


T 1 

= A 1 

1 1 

= 0 

-n. A 1 

N 

H 

<1 

II 

{CnD) 1 

= c I nD I 

(VR.C) 1 

= {a G A 1 Mb. (a, b) g R 1 -> b e C 1 } 

{BR.Tf 

= {a G A 1 3b. (a,b) £ R 1 }. 


There are several possibilities for extending AC in order to obtain a more 
expressive Description Logic. The three most prominent are adding addi- 
tional concept constructors, adding role constructors, and formulating re- 
strictions on role interpretations. Below, we start with the third possibility, 
since we need to refer to restrictions on roles when defining certain con- 
cept constructors. For these extensions, we also introduce a naming scheme. 
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Basically, each extension is assigned a letter or symbol. For concept con- 
structors, the letters/symbols are written after the starting AC , for role 
constructors, we write the letters/symbols as superscripts, and for restric- 
tions on the interpretation of roles as subscripts. As an example, the De- 
scription Logic ACC Q^ 1 extends AC with the concept constructors negation 
(C) and qualified number restrictions ( Q) , the role constructor inverse ( -1 ), 
and the restriction that some roles are transitive (r c ). 

Restrictions on role interpretations 

These restrictions force the interpretations of roles to satisfy certain proper- 
ties, such as functionality and transitivity. We consider these two prominent 
examples in more detail. Others would be symmetry or connections between 
different roles. 1 

1 . Functional roles. Here one considers a subset Np of the set of role names N R - 
its elements are called features. An interpretation must map features / to func- 
tional binary relations f x C A x x A x , i.e., relations satisfying Va, b, c.f x (a , b) A 
/ x (a, c) — > b = c. Sometimes functional relations are viewed as partial func- 
tions, and thus one writes f x (a ) = b rather than / x (a, b). AC extended with 
features is denoted by ACf. 

2 . Transitive roles. Here one considers a subset N R + of N R . Role names R € N R + 
are called transitive roles. An interpretation must map transitive roles R € N R + 
to transitive binary relations R x C A x x A x . AC extended with transitive roles 
is denoted by AC R + . 


Concept constructors 

Concept constructors take concept and/or role descriptions and transform 
them into more complex concept descriptions. Table A.l shows the syntax 
and semantics of common concept constructors. In order to have them all 
in one place, we also repeat the ones from AC, minus atomic negation and 
limited existential quantification since they are special cases of negation and 
existential quantification. 

Some explanatory remarks are in order. The symbols 14,1(2 in the agree- 
ment constructor stand for chains of functional roles, i.e., 14 = /1 . . . f m and 
U2 = gi ■ ■ - g n where n, to > 0 and the fi,gj are features. The semantics of 
such a chain is given by the composition of the partial functions interpreting 
its components, i.e., trf(a) = //, ( . . . (a) . . .). Nominals (or individuals) in 

concept expressions are interpreted as singleton sets, consisting of one el- 
ement of the domain. We assume that names for individuals come from a 

1 One could also count role hierarchies as imposing such restrictions. Here we will, however, treat 
role hierarchies in the context of knowledge bases. 
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Table A.l. Some Description Logic concept constructors. 


Name 

Syntax 


Semantics 

Symbol 

Top 

T 

A 1 


AC 

Bottom 

T 

0 


AC 

Intersection 

CnD 

c x nD x 


AC 

Union 

RUS 

C X UD X 


U 

Negation 

'C 

A I \C I 


C 

Value 





restriction 

MR.C 

H 

<1 

W 

Mb. (a, b) G R 1 — > b G C 1 } 

AC 

Existential 





quant . 

3R.C 

H 

<1 

3b. (a, b) £ R x Abe C 1 } 

£ 

Unqualified 

^ nR 

{a £ A 1 

\{b £ A 1 | (a,b) £ R 1 } | > n} 


number 

^ nR 

{a G A 1 

\{b £ A 1 | (a, b) £ R x }\ < n} 

AT 

restriction 

= nR 

{a G A 1 

\{b £ A x (a,b) £ R x }\ = n} 


Qualified 

> nR.C 

H 

<1 

u; 

\{b £ A x | (a, b) £ R 1 Abe C x }\ > n} 


number 

< n.R.C 

H 

<1 

O' 

{5 £ A x \ {a,b) £ R x Abe C x }\ < n} 

Q 

restriction 

= n R.C 

{a G A 1 

(6 G A x | (a, b) G R x Abe C x }\ = n} 


Role- value- 

RC S 

{a G A 1 

Mb. (a, b) £ R x — > (a, b) £ S 1 } 


map 

R = S 

{a G A 1 

Mb.(a, b) £ R x <-> (a, b) £ S x } 


Agreement and iq = u 2 

{a G A 1 

3b £ A 1 . u x (a) = b = u x (a)} 

T 

disagreement 

ui yf u 2 

{a G A 1 

3bi,b 2 £ A 1 . u x (a ) = bi ^ b 2 = wf(a)} 


Nominal 

I 

I 1 C A 1 with \I X \ = 1 

O 


name space disjoint from the set of concept and role names. Since role- value- 
maps cause undecidability and thus are no longer used in DL systems, there 
is no special symbol for them in the last column of Table A.l. 

Many DL systems employ a LiSP-like concrete syntax. Table A. 2 intro- 
duces this syntax and gives a translation into the abstract syntax introduced 
in Table A.l. 


Role constructors 

Role constructors take role and/or concept descriptions and transform them 
into more complex role descriptions. Table A. 3 shows the syntax and seman- 
tics of common role constructors. 

The symbol o denotes the usual composition of binary relations, i.e., 


R 1 o S 1 = {(a, c) | 3b. (a, b) e R 1 A (6, c) e S 1 }. 
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Table A. 2. Concrete syntax of concept constructors. 


Name 

Concrete syntax 

Abstract syntax 

Top 

TOP 

T 

Bottom 

BOTTOM 

1 

Intersection 

(and Ci . . . C„) 

Ci n • • • n c n 

Union 

(or C 1 ...C„) 

C 1 U---UC n 

Negation 

(not C) 

'C 

Value restriction 

(all R C) 

MR.C 

Limited existential quantification 

(some R) 

3R.T 

Existential quantification 

(some R C) 

3R.C 

At-least number restriction 

(at-least n R) 

^ nR 

At-most number restriction 

(at-most n R) 

^ nR 

Exact number restriction 

(exactly n R) 

— nR 

Qualified at-least restriction 

(at-least n R C) 

^ nR.C 

Qualified at-most restriction 

(at-most n R C) 

< nR.C 

Qualified exact restriction 

(exactly n R C) 

= nR.C 

Same-as, agreement 

(same-as ui U2) 

U\ = u 2 

Role- value-map 

(subset Ri R2) 

R\ c R 2 

Role fillers 

(fillers R Q . . . I„) 

3R.ii n • • • n 3 R.i n 

One-of 

(one-of Ii . . . I„) 

h u • • • u l n 


Table A. 3. Some Description Logic role constructors. 


Name 

Syntax 

Semantics 

Symbol 

Universal role 

U 

H 

<1 

X 

H 

<1 

U 

Intersection 

RnS 

r 1 ns 1 

n 

Union 

RuS 

R 1 U S 1 

u 

Complement 

^R 

A 1 x A 1 \ R 1 

— 1 

Inverse 

R~ 

{(6, a) G A 1 x A 1 | (a, b) G R 1 } 

-1 

Composition 

RoS 

R t oS x 

O 

Transitive closure 

R+ 

u„>i (R T r 

+ 

Reflexive-transitive closure 

R* 

U„> 0 {R T ) n 

* 

Role restriction 

R\c 

R 1 n (A x x c 1 ) 

r 

Identity 

id(C) 

{{d,d) 1 dec 1 } 

id 
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Table A. 4. Concrete syntax of role constructors. 


Name 

Concrete syntax 

Abstract syntax 

Universal role 

top 

U 


Intersection 

( and Ri • • • R n ) 

Ri n • • 

■ n R n 

Union 

(or Ri • • • R„) 

Ri U • • 

•u R n 

Complement 

(not R) 

— li? 


Inverse 

(inverse R) 

R~ 


Composition 

(compose R x ••• R„) 

R\ o ■ ■ 

• o R n 

Transitive closure 

(transitive-closure R) 

R+ 


Reflexive-transitive closure (reflexive-transitive-closure 

R) R* 


Role restriction 

(restrict R C) 

R\c 


Identity 

(identity C) 

id(C) 



Iterated composition is denoted in the form ( R x ) n . To be more precise, 

(. R J )° = {{d, d) | d G A 1 } and {R x ) n+1 = (i? J ) n o R 1 . 

Transitive and reflexive-transitive closure are the only constructors among 
the ones introduced so far that cannot be expressed in first-order predicate 
logic. 

The LiSP-like concrete syntax for role constructors can be found in Ta- 
ble A.4. 


A. 2. 2 Knowledge bases 

A DL knowledge base usually consists of a set of terminological axioms 
(often called a TBox) and a set of assertional axioms or assertions (often 
called an ABox) . The syntax and semantics of these axioms can be found in 
Table A. 5. An interpretation X is called a model of an axiom if it satisfies 
the statement in the last column of the table. 

An equality whose left-hand side is an atomic concept (role) is called a 
concept (role) definition. A finite set of definitions is called a terminology 
or TBox if the definitions are unambiguous, i.e., no atomic concept occurs 
more than once as left-hand side. Axioms of the form CAD for a complex 
description C are often called general inclusion axioms. A set of axioms of 
the form RAS where both R and S are atomic is called a role hierarchy. 
Such a hierarchy obviously imposes restrictions on the interpretation of 
roles. Thus, the fact that the knowledge base may contain a role hierarchy 
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Table A. 5. Terminological and assertional axioms. 


Name 

Syntax 

Semantics 

Concept inclusion 

CCD 

C 1 C D 1 

Role inclusion 

RCS 

R 1 C S 1 

Concept equality 

C = D 

H 

Cl 

II 

Role equality 

R = S 

R 1 = S T 

Concept assertion 

C(a) 

a 1 G C x 

Role assertion 

R(a, b) 

(< a ; x , b 1 ) G R x 


Table A. 6. Concrete syntax of axioms. 

Name 

Concrete syntax 

Abstract syntax 

Concept definition 

(def ine-concept A C) 

A = C 

Primitive concept introduction (def ine-primitive-concept A 

C) ACC 

General inclusion axiom 

(implies C D) 

CCD 

Role definition 

(define-role R S) 

R = S 

Primitive role introduction 

(def ine-primitive-role R S) 

RCS 

Concept assertion 

(instance a C) 

C (° ) 

Role assertion 

(related a b R) 

R(a , b) 


is sometimes indicated by appending a subscript Ti to the name of the 
Description Logic (see “Restrictions on role interpretations” above). 

The concrete LiSP-like syntax in Table A. 6 distinguishes between termino- 
logical axioms with atomic concepts as left-hand sides and the more general 
ones. Following the convention mentioned at the beginning of this appendix, 
A denotes an atomic concept, and R denotes an atomic role. 

A. 3 Additional constructors 

Here we mention some of the additional constructors that occur somewhere 
in the handbook. For most of them, the semantics cannot be described in a 
compact manner, and thus we refer to the respective chapter for details. 


A. 3.1 Concept and role constructors 

Many additional constructors are introduced in Chapter 6. In Description 
Logics with concrete domains one can use concrete predicates to constrain 
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fillers of feature chains, similarly to the use of the equality predicate in 
feature agreements. For example, if hasAge is a feature and >is the unary 
concrete predicate consisting of all nonnegative integers greater than or equal 
to 18, then 3hasAge.>is describes the individuals whose age is greater than 
or equal to 18. In general, an existential predicate restriction is of the form 

3(ui, . . .,u n ).P, 

where P is an n-ary predicate of the underlying concrete domain and 
ui, . . . ,u n are feature chains. One can also use concrete domain predicates 
to define new roles. For example, 3(hasAge)(hasAge).> consists of all pairs 
of individuals having an age such that the first individual is older than the 
second one. The general form of such a complex role is 

3(m, . . .,u n )(vi, . . .,v m ).P, 

where P is an (n + m)- ary predicate of the underlying concrete domain and 
u\, . . . , u n , and vi,...,v m are feature chains. 

In modal extensions of Description Logics, one can apply modal operators 
to concepts and/or roles, i.e., if □ is such a modal operator, C is a concept, 
and R is a role, then 

DC and OR 

are a concept and a role, respectively. Similarly, one can also use diamond 
operators O to obtain new concepts and roles. A special such modal operator 
is the epistemic operator K, which can be used to talk about things that 
are known to the knowledge base. 

Chapter 5 introduces several additional constructors. Least and greatest 
fixpoint semantics for cyclic terminologies (see Chapter 2) can be generalized 
by introducing fixpoint constructors directly into the description language. 
Let A be a concept name and C a concept description containing the name 
X. Then 


p,X.C and vX.C 

are new concept descriptions respectively obtained by applying the least 
and the greatest fixpoint constructor to C. To ensure that the least and the 
greatest fixpoint exist, one must restrict C to be syntactically monotonic, 
i.e., every occurrence of X in C must be in the scope of an even num- 
ber of complement operators. For example, given an interpretation Mari' 1 of 
Man and hasChild 1 of hasChild, the concept z/Momo.(Man n VhasChild.Momo) 
looks for the greatest interpretation Momo 1 of Momo such that Momo 1 = 
(Man n VhasChild.Momo)- 1 . It is easy to see that this is the set of all men 
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having only male offspring (see Chapter 2 for the corresponding example 
with a cyclic TBox). 

Chapter 5 also considers the Description Logic VCTZ, in which the re- 
striction to at most binary predicates is no longer enforced. If R is an n-ary 
predicate, i £ {1, . . . , n}, and A: is a nonnegative integer, then 

3[$i\ R 

denotes the concept collecting those individuals that occur as the ith com- 
ponent in some tuple of R, and 

^ k [$z] R 

denotes the concept collecting those individuals d for which the predicate R 
contains at most k tuples whose ith component is d. Conversely, if C is a 
concept, n a nonnegative integer, and i £ {1, . . . , n}, then 

($i/n : C) 

denotes the n-ary predicate consisting of the tuples whose ith component 
belongs to C . The Description Logic VC1Z also allows Boolean operators on 
both concepts and predicates. 2 


A. 3. 2 Axioms 

In addition to the semantics for terminological axioms introduced above, 
Chapter 2 also considers fixpoint semantics for cyclic TBoxes. 

Chapter 6 introduces several ways of extending the terminological and the 
assertional component of a DL system. In Description Logics with concrete 
domains one can use concrete predicates also in the ABox in assertions of 
the form 


P(x i, • • ■ ,x n ), 

where P is an n-ary predicate of the underlying concrete domain and 
x\, ... ,x n are names for concrete individuals. 

In some modal extensions of Description Logics, one can apply modal and 
Boolean operators also to terminological and assertional axioms: if ip, are 
axioms, then so are 


<p/\il>, -up, □</?. 

2 Note, however, that negation on predicates has a non-standard semantics (see Chapter 5 for 
details). 
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In probabilistic extensions of Description Logics, one can use probabilistic 
terminological axioms of the form 

P(C\D) = p, 

which state that the conditional probability for an object known to be in D 
to belong to C is p. 

The integration of Reiter’s default logic into Description Logics yields 
terminological defaults of the form 

C(x) : D(x) 

E(x) ’ 

where C,D,E are concept descriptions (viewed as first-order formulae with 
one free variable x). Intuitively, such a default rule can be applied to an 
ABox individual a, i.e., E(a) is added to the current set of beliefs, if its 
prerequisite C(a) is already believed for this individual and its justification 
D{a) is consistent with the set of beliefs. 

Rules of the form 


C^E 

(as introduced in Chapter 2) can be seen as a special case of terminological 
defaults where the justification is empty. Their intuitive meaning is: “if an 
individual is known to be an instance of C, then add the information that 
it is also an instance of E.” 


A. 4 A note on the naming scheme for Description Logics 

In Section A. 2 we have introduced a naming scheme for Description Logics, 
which extends the naming scheme for the AC-family introduced in Chap- 
ter 2 by writing letters/symbols for role constructors as superscripts, and for 
restrictions on the interpretation of roles as subscripts. The reason was that 
this yields a consistent naming scheme, which distinguishes typographically 
between the three different possibilities for extending the expressive power 
of AC. 

In the literature, and also in this handbook, other naming schemes are 
employed as well. One reason for this, in addition to the fact that such 
schemes have evolved over time, is that it is very hard to pronounce a name 
like ACCQf j|. We will here point out the most prominent such naming 
schemes. 

The historically first scheme is the one for the AT-family introduced in 
Chapter 2, and extended in this appendix. However, in the literature the 
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typographical distinction between role constructors, concept constructors, 
and restrictions on the interpretation of roles is usually not made. For ex- 
ample, many papers use T to denote inverse of roles, 1Z to denote intersection 
of roles, and TL to denote role hierarchies. Thus, ACC1ZT denotes the exten- 
sion of ACC by intersection and inverse of roles, and ACCTL denotes the 
extension of ACC by role hierarchies. In some cases, the letter X", which we 
have employed to express the presence of feature agreements and disagree- 
ments, is used with a different meaning. Its presence states that number 
restrictions of the form ^ 1 R can be used to express functionality of roles. 3 
The subscript “trans” (or “reg”) is often employed to express the presence 
of union, composition, and transitive closure of roles (sometimes also includ- 
ing the identity role). The Greek letter p, in front of a language name, as in 
IjACC. usually indicates the extension of this Description Logic by fixpoint 
operators. 

All members of the AX-family include AC as a sublanguage. In some 
cases one does not want all the constructors of AC to be present in the 
language. The Description Logic XX~ is obtained from AC by disallowing 
atomic negation, and XXo is obtained from XX - by, additionally, disallowing 
limited existential quantification. If these languages are extended by other 
constructors, one can indicate this in a way analogous to extensions of AC. 
For example, X C~U denotes the extension of XX~ by union of concepts. 

All the Description Logics mentioned so far contain the concept construc- 
tors intersection and value restriction as a common core. Description Log- 
ics that allow intersection of concepts and existential quantification (but 
not value restriction) are collected in the XX-family. The only construc- 
tors available in XX are intersection of concepts and existential quantifica- 
tion. Extensions of XX are again obtained by adding appropriate letters/ 
symbols. 

In order to avoid very long names for expressive Description Logics, the 
abbreviation S has been introduced for ACC nr ^ i.e. , the Description Logic 
that extends „4XC by transitive roles. Prominent members of the X-family 
are STM (which extends ACCr+ with number restrictions and inverse roles), 
STLTLF (which extends ACC nr with role hierarchies, inverse roles, and num- 
ber restrictions of the form ^1 R ), and STCTQ (which extends ACC nr with 
role hierarchies, inverse roles, and qualified number restrictions). Actually, 
the Description Logics STM, STL. IX, and STLIQ are somewhat less expres- 
sive than indicated by their name since the use of roles in number restrictions 

3 Unlike the restriction of R to be functional, which we express with a subscript /, this allows 
local functionality statements, i.e., R is functional at a certain place, but may be non-functional 
at other places. 
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is restricted: roles that have a transitive subrole must not occur in number 
restrictions. 

The Description Logic VC1Z mentioned in the previous section also gives 
rise to a family of Description Logics, with members like VClZ reg , which 
extends VC1Z with union, composition, and transitive closure of binary 
relations obtained as projections of n-ary predicates onto two of their 
components. 
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A, see atomic concept 
C, see concept 

C i n C 2 , see intersection of concepts 

Ci => C 2 , see trigger rule 

C\ = C 2 , see equality axiom 

Ci C 2 , see equivalent w.r.t. TBox 

Ci C C 2 , see inclusion axiom 

Ci C.'T C 2 , -see subsumption 

Ci U C 2 , see union of concepts 

C(a), see concept assertion 

P, see atomic role 

P, see atomic relation 

P , see role 

R, see relation of arbitrary arity 

R * , see reflexive-transitive closure of roles 

P+, see transitive closure of roles 

R ~ , see inverse role 

Pi = P 2 , see equality role- value-map 

Pi Q P 2 > see containment role- value- map 

Pi n P 2 , see intersection of roles 

Ri n R 2 , see intersection of relations 

Pi 0 P 2 ? see composition of roles 

Pi = P 2 , see equality axiom 

Pi C. P 2 , see inclusion axiom 

Pi U P 2 , see union of roles 

Ri l_l R 2 , see union of relations 

R\ c , see role restriction 

P(ai,a 2 ), see role assertion 

T \= Ci = C 2 , see equivalent w.r.t. TBox 

T \= Ci C C 2 , see subsumption 

( %i/n:C ), see selection on relations 

^fe[$i]R, $Cfc[$ 2 ]R, see 

number restriction on relations 
^nP, see at-least restriction 
^nP.C, ^nP.C, see 

qualified number restriction 
^nP, see at-most restriction 
_L, see bottom concept 
A x , see interpretation domain 
/, see feature chain 
fi = f 2 , see agreement 
fi 7 ^/ 2 , see disagreement 
VP.C, see value restriction 


id(R), see role identity 

fiX.C, see least fixpoint operator 

—1 A, see atomic negation 

-i C, see complement of concepts 

-i P, see complement of roles 

->R, see difference of relations 

vX.C , see greatest fixpoint operator 

3P.C, see full existential quantification 

3P.T, see limited existential quantification 

3[$i]R, see 

existential quantification on relations 
T, see top concept 
2+2-sat, 142 

2ATA, see two-way alternating tree 
automaton 

ABox, 14, 50, 65, 213, 399 
absorption, 350 
acceptable solution, 228 
accessibility relation, 42, 167, 247 
between intervals, 260 
equivalence, 250 
reflexive and transitive, 167 
transitive, 172 

transitive, serial, and euclidean, 250 
universal, 253 
access to knowledge bases 
advanced application, 290 
basic, 287 

concurrent access, 293 
data structure, 292 
notification, 292 
remote access, 293 
action-guarded fragment, 174 
acyclic TBox, 57, 59, 95, 133 
admissible concrete domain, 242 
advanced application access, 290 
AE-model, 257 
agent, 167 

agreement, 103, 235, 305 
AC , 52, 54 

complexity with axioms, 132 
complexity with restricted axioms, 133 
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ACC, 54 

complexity, 125 
ACCB, 256 “ 

ACC(D), 240 
ACCTlreg , 202 , 212 
complexity, 209 
ACClOreg, 216 
ACCTreg, 195, 200 
complexity, 201 
ACCK., 252 
ACCM, 54, 85, 93 
complexity, 126 

of ABox consistency, 93 
of concept satisfiability, 92 
w.r.t. general inclusion axioms, 94 
ACC Q2, 232 
ACCQXreg, 209 
complexity, 213 
ACCQOreg, 216 
ACCreg, 168, 195, 200 
complexity, 201 
ACCt rans 

complexity, 127 
ACS, 54 

complexity, 117 
ACSM, 54 

complexity, 126 
AlFresco, 491 
AC-log, 514 
ACM, 54, 85, 96 

complexity with restricted axioms, 133 
ACMX, 227 
ACAf( n) 
complexity, 126 

AC( n) 

complexity, 121 

Alternating Polynomial Space, 128 
ACU, 54 
ACUNX, 229 
ACU{ n) 
complexity, 126 
AC family, 53, 54 
AND-OR graph accessibility, 129 
approximation, 273, 276, 281 
arity, 505 
assertion, 50 

concept, see concept assertion 
inequality, see inequality assertion 
membership, see membership assertion 
probabilistic, see probabilistic assertion 
role, see role assertion 
at-least restriction, 53 
at-most restriction, 53 
AT&T, 419, 421, 429 
atomic 

concept, 7, 51, 52 
negation, 52 
relation, 221 
role, 7, 51 
symbol, 51 
ATP, 427 


attribute, 103, 175, 181, 380 
in the ER model, 175, 505 
in XML, see XML attribute 
partial, see partial attribute 
total, see total attribute 
autoepistemic logic, 256 
axiom schema instantiation, 209 

Back, 113, 319, 323 
backjumping, 362 
backtracking search, 362 
base interpretation, 57 
base symbol, 56, 98 
basic access to knowledge bases, 287 
basic role, 202, 209, 234, 235 
Bayesian network, 266 
BCP, see Boolean constraint propagation 
belief operator, 246 
Beth’s Definability Theorem, 59 
blocking, 94, 341, 343, 362, 369 
Boolean constraint propagation, 361, 366 
Boolean operators on roles, 99, 100, 165, 210, 
224, 508 

bottom-up approach, 274 
bottom concept, 52 
bottom search, 354, 355 
branching degree, 203 
breadth first search, 354 
Biichi 

acceptance condition, 204 
automaton, 97 

C, see complement of concepts 
C 2 , 82, 100, 101, 164, 232 
caching, 342, 344, 356, 358, 369 
Candide, 513 

canonical interpretation, 90 

cardinality constraint, 176, 178, 505 

Carin, 514 

CarsDirect, 418 

CASE tool, 175, 180, 186 

CCNF, see concept-centered normal form 

CG, 155 

characteristic concept, 187 
Chimaera, 295 
C k , 164 

clash, 87, 89, 92 
class, 175 

declaration, 181 
class-based formalism, 40 
Classic, 115, 239, 286, 314, 337, 424, 431, 
460, 513 

classification, 15, 308, 333, 347, 353 

classifier, 309 

CLIN-S, 338 

Clinergy, 446, 453 

closed- world semantics, 65, 75 

code-based management system, 410 

CodeBase, 408 

Common Domain Assumption, 78 
common knowledge, 250 
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complement 

of concepts, 8, 53 
of roles, 99 , 162, 338 
complete, 90 
ABox, 87, 89 
completeness 

of algorithm, 81, 85, 90 
of characterization, 83 
composition of roles, 99 , 195, 234 
compound 
concept, 230 
role, 230 

computational complexity, 11 
concept, 50, 150 
assertion, 16, 65 
atomic, see atomic concept 
complement, see complement of concepts 
conjunction, see intersection of concepts 
constructor, 51 
description, 52 

disjointness, see disjointness of concepts 
disjunction, see union of concepts 
intersection, see intersection of concepts 
language, 3 

necessary condition, 385 
negation, see complement of concepts 
partial condition, 385 
pattern, 277 

reified, see reified concept 
type, 157 

union, see union of concepts 
variable, 277 

concept-centered normal form, 97 , 274 
conceptual model, 378 
conceptual graph, 40, 155 

simple, see simple conceptual graph 
conceptual modeling, 503 
concrete domain, 37, 240 
concurrent access to knowledge bases, 293 
configuration, 26, 286, 296, 417 
conformance, 187 
conjunction 

of concepts, see intersection of concepts 
of roles, see intersection of roles 
connected model property, 200 
coNP-hardness 
in PC, 110 
consistency, 72 , 93 
of knowledge base, see 

knowledge base consistency 
consistent 

ABox, see consistency 
set of ABoxes, 87 

constant domain assumption, 248, 257 
constraint satisfiability, 363 
containment role- value- map, 102 , 134 
converse- dpdl, 198, 202 
converse- pdl, 197, 209 
counting quantifier, 82, 163 
Crack, 322 
Cyc, 449 


cycle, see cyclic TBox 
cyclic TBox, 57 , 96, 133 

DAML, 438, 465 
DAML+OIL, 438 
DAML+OIL, 465 
DAML-ONT, 465 

DARPA Agent Markup Language, see DAML 
data 

complexity, 140 
integration, 517 
mining, 29 
model 

object-oriented, see 

object-oriented data model 
semantic, see semantic data model 
semistructured, see 

semistructured data model 
structure 
access, 292 

inductive and coinduct ive, 217 
database, 31, 500 
Datalog, 503, 514, 524 
default logic, 34, 258 
default rule, 35, 268 
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concept, 57 , 308 
subconcept, 388 
definition, 14, 56 , 334 
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necessary and sufficient, see definition 
non-primitive, see definition 
order, 355 

primitive, see specialization 
definitorial, 57 , 58, 59 
Dell, 418 

dependency set, 363 
description language, 52 
Description Logic courses, 302 
descriptive semantics, 59, 96, 163, 196 
deterministic-converse-PDL, see converse- dpdl 
deterministic program, 171 
difference of relations, 221 , 222, 508 
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disagreement, 103 
discovery software, 403, 405 
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of concepts, 68 
of subconcepts, 387 
disjoint union model property, 211 
disjunction 

of concepts, see union of concepts 
of roles, see union of roles 
DLMS, 492 
Dlp, 325, 330 
VCR, 174, 221 , 507 
complexity, 223 
VCR^, 174, 224 
VCRreg, 224 
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DOM, see Document Object Model 
domain, 175, 463 

of interpretation, see interpretation domain 
domino problem, see tiling problem 
DPDL, 198 
DPLL, 360 
DTD, see XML DTD 
DWQ, 512 

dynamic backtracking, 372 
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property, see property edge 
subclass/superclass, see 

subclass / superclass edge 
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content, see XML element content 
type, see XML element type 
enhanced traversal, 355 
entity, 175, 505 
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model, 175, 501, 504 
schema, 175, 506 
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interpretation, 79 
logic, 35, 247 
model, 79, 254 
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operator, 77, 251, 400 
query, 254 
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equality axiom, 56 
equality role- value-map, 102, 133 
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error handling, 297 
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on relations, 221, 508 
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expansion, 58, 71, 72, 334 
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feature, 103, 305, 324, 380 
chain, 103, 305 
fills, 66, 216 
FindUR, 287, 301, 460 
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finite model 

property, 83, 100, 179 
reasoning, 179, 227 
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first-order predicate logic, 54 
Fischer-Ladner closure, 198, 204 
fixpoint 

construct, see fixpoint operator 
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model, 61 

mutual, see mutual fixpoint 
operator, 96, 147, 168, 187, 217 
semantics, 59, 169, 186, 221 

greatest, see greatest fixpoint semantics 
least, see least fixpoint semantics 
TC 

complexity, 109, 126 
TC~ (o, C), 134 
TCo, 52, 83, 95, 96, 274 
TC_ l, 84 
TC~E 

complexity, 121 

TC~ CM 

complexity, 112 
Flex, 319 
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complexity, 123 
TC ~ , 52 
flow of time, 259 
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complexity, 122 

,-) 

complexity, 121 
Ford, 419 
frame, 5, 152, 463 

system, 2, 41, 152, 305 
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approach to KR, 13 
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relation, 172 
restriction, 202 
role, 103 
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Description Logic, 36, 266 
logic, 266 
set, 266 

Galen, 331, 350, 436, 438, 444 
Games, 445 

generalized terminology, 64 

general inclusion, see inclusion axiom 

general inclusion axiom, see inclusion axiom 93 

Generic Frame Protocol, 287 

gfp, see greatest fixpoint 

global-as-view, 520 
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graded modality, 171. 209 
Grail, 337, 442, 447, 449-451, 453 
graph-based approach, 272 
greatest fixpoint, 61, 219 

operator, see fixpoint operator 
semantics, 61, 97, 221, 276 
guarded fragment, 173, 224 

7i, see role hierarchy 
Harp, 363 

hierarchy of roles, see role hierarchy 
hook, 292 
HTML, 458 
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logic, 170 
system, 20 

X, see inverse role 
ICD, 440 
i«COM, 180 
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Imacs, 18, 32 
incidental property, 384 
inclusion, see inclusion axiom 
inclusion axiom, 15, 56, 334, 452 
in terminology, 64 
individual, 150, 381 
identifier, 381, 383 
marker, 157 
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value, 382 

inductive learning, 37 
inequality assertion, 87 
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Information Manifold, 522 
inheritance, 150, 175, 181 
by default, 151 
network, 5 

instance checking, 16, 73, 140 
complexity, 143 

integrity constraint, 35, 255, 258 
intensional knowledge, 13 
inter-schema relationship, 519 
internalization, 128, 163, 166, 199, 201, 225, 
335, 336, 344 
interpretation, 52, 66 

canonical, see canonical interpretation 
domain, 52 
function, 52 
intersection 
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of concepts, 7, 8, 52 
of relations, 221, 508 
of roles, 9, 99, 160, 162, 234 
interval-based temporal model, see 
temporal model 

inverse role, 41, 99, 160, 162, 165, 172, 195, 
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IS-A, 5, 30, 392, 448, 506 

between relationships, 179, 506 
in the ER model, 176, 505 
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KADS, 439 
Kandor, 312 
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key, 176, 391 
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Kl-One, 19, 133, 151, 306, 337 

K m , 166 
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base, 50 

consistency, 16 
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extensional, see extensional knowledge 
intensional, see intensional knowledge 
Kreme, 323 
K-Rep, 239, 323, 442 
Kripke 

frame, 172, 249, 253 
structure, 167, 172, 197 
Kris, 321, 330, 347 
Krss, 27, 287, 438, 441 
Krypton, 310, 337 
Ksat, 338 

£ 2 , 82, 162, 164 
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language-based approach, 272 
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lcs, see least common subsumer 

least common subsumer, 37, 273, 275 

least fixpoint, 61, 219 

operator, see fixpoint operator 
semantics, 61, 187, 221 
legal database state, 183, 507 
less expressive, 162 
level, 89 
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Lilog, 491 

limited existential quantification, 52 
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w.r.t. finite models, 232 
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logic of knowledge and belief, 34 
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Mai a, 491 
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medSYNDiCATE, 444 

membership assertion, 16, 65, 213 

Meson, 239 

meta-role, 386 

MKNF, 256 

modalized Description Logic, 246 
modal logic, 34, 42, 82, 166, 246 
model 

epistemic, see epistemic model 
fixpoint, see fixpoint model 
of ABox, 66 
of concept, 68 

of terminological axioms, 56 
modeling methodology, 399 
model checking, 75 
MOMS heuristic, 366 
monotone terminology, 62 
most specific concept, 37, 74, 140, 274, 276 
msc, see most specific concept 
/z-calculus, 96, 166, 168, 217 
fiACCQ , 187 
li ACC Ql, 217 
complexity, 220 
multi-modal logic, 167 
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name symbol, 56 

natural language processing, 30, 487 
DRT/DRS, 497 
lexical choice, 498 
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logical form, 487, 488, 492 
semantic interpretation, 487, 488 
necessary and sufficient definition, see 
definition 

necessary definition, see specialization 
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negation normal form, 86, 205, 241, 340, 349 
negative introspection, 248, 252 
NeoClassic, 292 
network-based structure, 2, 5 
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nominal, 66, 170, 216 
non-determinism, 344 
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logic, 33, 256, 268 
non-primitive, 346 

non-primitive definition, see definition 
non-standard inference, 270 
normalization, 348 
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NP-hardness 
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number restriction, 9, 53, 101, 163, 171, 
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on relations, 221, 508 

qualified, see qualified number restriction 
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O, see nominal 
object-oriented 

data model, 41, 181 
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DBMS, 512 
schema, 181 

obvious (non-) subsumption, 355 

OIL, 431, 438, 461, 463 

OilEd, 466 

OKBC, 287, 438 

oldest-first heuristic, 367 

one-of, 66, 141, 160, 162, 165, 170, 216 

OntoBuilder, 294, 431 

Ontolingua, 294 

ontology, 30, 377, 487, 490, 495, 498 
open- world assumption, see 
open-world semantics 
open- world semantics, 8, 65, 75, 400 
Open Knowledge Base Connectivity, see 
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OWL, 459 

part-whole, 25, 395, 448 

partial attribute, 380 

path tracing, 413 
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PCDATA, see XML PCDATA 

PDL, see Propositional Dynamic Logic 

pdl, 82, 127, 168, 197 

Pellet, 463 

Penni, 311 

Pen&Pad, 438, 446, 453 
planning, 29 

point-based temporal model, see 
temporal model 
positive introspection, 248, 252 
possible worlds semantics, 247, 252 
Post correspondence problem, 139 
Pracma, 491 
predicate logic, 161 
preferential Description Logic, 497 
primitive 
concept, 57 

definition, see specialization 
role, 57 
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probabilistic 
assertion, 265 
Description Logic, 36, 263 
terminological axiom, 263 
procedural 
extension, 77 
rule, see trigger rule 
Prodigy, 445 
program, 168, 197 

deterministic, see deterministic program 
projection, 159 
property edge, 150 

Propositional Dynamic Logic, 82, 127, 168, 

197 

complexity, 200 

converse- dpdl, see converse- dpdl 
converse- pdl, see converse- pdl 
correspondence with Description Logics, 200 
dpdl, see DPDL 
Fischer-Ladner closure, see 
Fischer-Ladner closure 
internalization of axioms, 199 
logical implication, 199 
pdl, see pdl 
program, see program 
satisfiability, 199 

with graded modality, see graded modality 
propositional /^-calculus, see /x-calculus 
propositional reasoning, 342 
Prose, 18, 422, 429, 431, 433 
Protege, 438, 439, 445, 466 
pruning, 298 
P SPACE-hardness 

in AC with cyclic axioms, 133 
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Q, see qualified number restriction 
qualified number restriction, 101, 163, 171, 
209, 305, 313, 324 
Quantified Boolean Formulae, 125 
query 

answering using views, 516, 522 
complexity, 140 
consistency, 513, 521 
containment, 521 
epistemic, see epistemic query 
language, 502, 513 
optimization, 515 
rewriting, 281, 522 
Questar, 429, 433 

Racer, 326, 330, 372, 463 
range, 463 
RDF, 461 
RDFS, 463 

realization, 16, 74, 140, 309 
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regular expression over roles, 127, 168, 195, 
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reification, 177, 180, 210, 391 
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reified concept, 386 
reified counterpart, 210, 223 
relation 

difference, see difference of relations 
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of arbitrary arity, 173, 177, 180, 221, 507 
selection, see selection on relations 
type, 157 

union, see union of relations 
well-typed, see well-typed relation 
relational data model, 501 
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in the ER model, 175, 505 
reified, see reification 
relationship role, 386, 392 
remote access to knowledge bases, 293 
retrieval, 16, 74 
rewriting 

concepts, 39, 281 
queries, see query rewriting 
rewriting system, 135 
rigid term assumption, 78 
role, 50 

agreement, see agreement 
assertion, 16, 65 
atomic, see atomic role 
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Boolean operators, see 

Boolean operators on roles 
chain, 102, 103, 234 
closure, 35 

complement, see complement of roles 
composition, see composition of roles 
conjunction, see intersection of roles 
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depth, 89 
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functional, see functional role 
hierarchy, 305, 324, 392, 450 
identity, 127, 195, 338 
inclusion axiom, 55, 224 
intersection, see intersection of roles 
inverse, see inverse role 
in the ER model, 176, 505 
meta, see meta-role 
negation, see complement of roles 
reflexive-transitive closure, see 

reflexive-transitive closure of roles 
restriction, 109 
transitive, see transitive role 
transitive closure, see 

transitive closure of roles 
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universal, see universal role 
role- value-map, 102, 133, 165, 234 
containment, see 

containment role- value- map 
equality, see equality role-value-map 
rule, 318, 424, 425 
ABox, 255 

default, see default rule 
knowledge base, 78 
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trigger, see trigger rule 
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same-as, 103 
SAT, 360 
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w.r.t. finite models, 231 
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Sb-One, 323 

selection on relations, 221, 508 
semantics, 52, 54 

closed-world, see closed-world semantics 
descriptive, see descriptive semantics 
fixpoint, see fixpoint semantics 
greatest fixpoint, see 

greatest fixpoint semantics 
least fixpoint, see least fixpoint semantics 
of rule, 255 
of trigger rule, 77 

open-world, see open-world semantics 
possible worlds, see 

possible worlds semantics 
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semantic data model, 41, 175 

correspondence with Description Logics, 177 
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semistructured data, 186, 512 
semistructured data model, 186 
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Spass, 338 
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structural 

subsumption, 10 
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subclass/superclass edge, 150 
subsume, see subsumption 
subsumption, 9, 68, 92, 414 
subtype/supertype, 175 
support, 157 
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symbolic 
name, 56 

number restriction, 102 
syntactic 

branching, 360 
simplification, 347 
syntactically monotone, 219 
terminology, 63 

tableau-based algorithm, 81, 85 
tag, see XML tag 
Tambis, 453 
TBox, 14, 50, 56 

acyclic, see acyclic TBox 
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Tell&Ask, 13 
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description logic, 258 
logic, 36, 169, 259 
model, 259 
operator, 246, 260 
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terminological 
axiom, 55 
default, 268 
default theory, 268 
system, 3 
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monotone, see monotone terminology 
syntactically monotone, see 

syntactically monotone terminology 
thrashing, 362 
tiling problem, 233, 234 
told subsumer, 354 
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top-down approach, 274 
top concept, 52 
top search, 354, 356 
total attribute, 380 
trace, 119 

technique, 342 
transformation rule, 87 
transitive 

closure of roles, 99, 127, 172 
role, 172, 225, 305 
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tree model property, 90, 202, 227 
trigger rule, 35, 76, 152, 251, 443 
Trilogy, 418 
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UMLS, see Unified Medical Language 
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web-based information system, 28 
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